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TITLE: Improved proteases and methods for producing them 
FIELD OF INVENTION 

T., e PreS e„ t .nve„c^rc s rr; r " de9radauon and,or 

. terminal amino acid JJ£ Z^T mB , Pro ' eaSeS ^ ' h8m * fr 

further provides the rosultZ^ """"^"p* <" a " «Wng C-terminus. The invention 
oes (he resulttng proteases comprising such amino add extensions 

invention fudherlT^ ,o nud 7 ""°** SUCh "» 

nucteic aoid m^Z^^l?,™*"* ^ ^ "* »™ 
within an^etfeed ™*ods for producing and using *. pleases. thpanJcutar 

15 BACKGROUND 

Polypeptides having protease activity, or proteases are 
peptidases, proteinases peptide hydro™— Pleases, are aoroef roes also designated 

exo-type M hydrolyses Z ^ TT ' ^ anZymeS ' P *"'— "»» "° * »» 

subctassas thereof) The EC ni 7 , ^ 9r ° UP ( " >C """ n9 each ° f ,he lhirt <*>" 

25 Acal)emi0 p££.^L TT '° En2Vme *»"«'"» 1 « *°m NC-IUBMB. 

Pdypepfide rosisten, ^^Z^^^ ** — - 

non-po,ar Otem,,na, «. of sec^^T " 3 ^ " "* 



10423.01 0-DK 



10 



15 



20 



25 



30 



Another disclosure reported, that proline residues at the C-terminus of nascent 
^peptide chains Induce degradation of the polypeptide (2002. Prolin residues at the C ten^inus 
of nascent chains induce SsrA ta gg in 9 during translation termination. J.Biol.Chem 277 338^5 

SUMMARY OF THE INVENTION 

tha, J'"", T"™" Problem a " °' expressing "avm 8 pmteolytic activity 

^ "* "I**"* 3re * «— * »<* -ay be subiec. fo autoproteolylb or 

*eymay be targeted for degraded by other pleases a, re ady duhng „»,rproduIn ~ 

Tb n it : 9reat ln,erest ,o pmue «» « 

stability, so that they may be produced in higher yields. 

The present inventors provide herein proteolytic polypeptides of the S2A and/or S1E 

r^tii Tr ,no t of 0,6 °" ,erminus ° f *• p ° ,ypep,kie - 

least three non-pdar or uncharged amino acid residues may be achieved by adding one or more 
an.no aa d(s, as a fusion-M to the po,ypepl«e. for Insfance by mod«ying the enccTg 

yTe^rr ^t"" 8 °' °' ,he — « much imprcved 

riur^T ^ ? ^ °' POlyPaP " daS ,ha ' * n0t "~ *~ amino acid 

configurabons, under otherwise identical conditions of production 

Accordingly, in a first aspect the invention relates to a secreted polypeptide whion „» 

zzzz r^r** enaopep,it,aaa a ^ ~° 

e^n ,,k T OT 9ad ^ "*° aCldS W " h ' n "» *» aml "° adds of the c 
terminus of the polypeptide, and which polypeptide: 

(a) comprises an amino acid sequence which is at least 70%. or preferably 75% 80% 
85%. 86%. 87%. 88%. 69%. 00%. 9,%. 92%. 93%. 94%. 95%. 96%. 97%. 98%. or 

r S Eo e ,n"^ t0 ,r amin0 "* SeqUenCe " *" «-« 0f *• P»*»»»«. Shown 

in SEQ ID NO: 28: SEQ ID NO: 33: SEQ ID NO: 37: SEQ ID NO: 41: or SEQ ID NO' 
43; 

(b) comprises an amino acid sequence which is at least 70%. or preferably 75%, 80%, 

■ 85% 86*. 87 %, 88 o /o , 89 o /0) 90%> 91% g2% 93%> ^ ge% g7% ^ 

/o .dentical to the amino acid sequence of the the mature part of the polypeptide 
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encoded by the polynucleotide in SEQ ID NO: 1; SEQ ID NO: 2; SEQ ID NO- 25- SEQ 
ID NO: 31 ; SEQ ID NO: 32; SEQ ,D NO: 36; or SEQ ID NO- 40- 
(c) .s-encoded by a nucleic acid sequence which hybridizes under 'very .ow. low medium- 
.ow, medium,, medium-high, high, or very high stringency conditions with' 
(I) a polynucleotide encoding the mature part of a protease, said 
polynucleotide obtainable from genomic DNA from Nocardiopsis 
dassonviliei subsp. dassonviliei DSM 43235 by use of primers SEQ ID 
NOs: 26 and 27; from Nocardiopsis Alba DSM 15647 by use of primers 
SEQ ID NO's: 34 and 35; from Nocardiopsis prasina DSM 15648 by use of 
primers SEQ ID NO's: 38 and 39; or from Nocardiopsis prasina DSM 15649 
by use of primers SEQ ID NO's: 42 and 39- 

"" ! , ;^ ucle « e * <*Q <0 NO: of SEQ ID NO: 2; C SEQ ,D NO: 25- of ' 
SEQ 1DNO: 3, : of SEQ ID NO: 32: of SEQ ,D NO: 36: or of SEQ !D NO- 
40; 

(HI) ^ q uenc e of(l)or(IOofa, l easl 5 OOnudeo«dss.p re ferably400.300 
200, or 100 nucleotides, or 

(IV) a complementary strand of (I), (||), or (III)- 

the an™, acd sequence of SEQ ID NO: 28; SEQ ID NO: 33; SEQ ID NO- 37- SEQ ID 

insert T Q 10 ^ "~ 3 SUbS,aU "° n ' <**»■ ^ 
insertion of one or more amino acids; 

(e) is an allelic variant of (a), (b), (c), or (d); or 

(f) is a fragment of (a), <b), (c), (d), or (e). 

Preferably the polypeptide belongs to the S2A, or the S1E peptidase families 

po jx^zzzzr re,ates to an iso,ated po, ~ — - 

Still, in a third aspect, the invention relates to a recomhin^* 

de^the eldaTeT^ '° ' ' eCmU ™ ^ - ™* « as - 

espect, or an expression vector or po^nucreotlde oonelrucf as denned ,„ „ 

In a fifth aspect, the Invention also relates to a transgenic olant or ntont ™» ~ 
polynucleotide as dafined in the second „,„..,, corepnsmg a 

cons.ruo.as defined in thJ^pT^ * " " * 



10423.01 0-OK 



10 



15 



20 



25 



30 



or eJlTtZT ' he . inVen,i0n to 8 ,ranS9B *' -** « * 

or dements thereof, compnsmg a polynucleotide as defined in the second aspect or an 
exp,ess ro nve Ctow p„ lynuc , eotl(teconstructasdefinedintfteihirdaspec( — -P-. °r en 

rjeZ 0 ™ 1 rate,es 'v me ^ for ^ * - 

riefinaH . ^ , P ' the method ^Posing: (a) cultivating a recombinant host cell as 

pot^p.1 ^ ^ °° mPnS,n9 ^ ~ ide ' - C) -covering the 

^jrsr^r ,o: an anima ' feed ad * re * ~ ~ 

(a) at least one fat-soluble vitamin, and/or 



(b) at least one water-soluble vitamin, and/or 

(c) at least one trace mineral; 



co m „,v n T" ,ee " C ° mP ° S " i0n hav *"> 8 <**>* P^'sin oontent of 50 to 800 ota and 

rzr rr e ~ de ■- — h - - — - - - 

with a, IT^'T °° mPriSin9 " ^ ^ P °' ypepWe 35 *»"'• •» •* aspect together 
«h a, eaa one pother enzyme seiected from amongs, phytase (EC 3.1 .3.8 or 3.1.3 28 ;X S e 
(EC 3.2.1.8); galactanase (EC 3.2.1 .89); atpha-galacfosldase (EC 3.2.1 .22); p ro ,aase (EC 3 7> 
Phosphofipase A1 (EC 3.1.1.32); phospholipase A2 (EC 3.1.1.4); ^J^ZZ^t 

ec « ; r 0 <3 ' 1A3,: *~* c *~ ° <ec " ^^^oZ^f- 

value of an an,mal feed, for increasing digestible and/or soluble protein in animal dk*> 

™rret e a ~^ 

DETAILED DESCRIPTION OF THE INVENTION 

Sedne IT^ZT^ °" °™* -** fc me<:hanlsm hto <"* ""'owing gmups- 

Sa ne pleases (S). Cysteine p rote ases (C). Aspadio proteases (A). M e«a«op r o,eases (M, and 
Unknown, or as ye. unclassified, proteases (U). see Handbook of Prcleolylic Enzymes 
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feur of Jean, £ SB SC I d S eTT "™ S ~ " ** * 

<S A) can, witNn „ hIch ^ have _ ^ ^ J^ta * ^ 

al 2002 MEROPS- . , ^ b, ° l09) ' : " * deSOibeii in Ra »«"8S. N.D. at 

. 2002, MEROPS. Tha proteasa databasa. Nudaic Acids Raa. 30:343-346 Tha MEROPS 

* 5l,ieoas 51E Proteases {Ravi ngs ND Barrett A l Homte ~i . 
of peptidases, Biochem. J. 290 205-218) ( * Ev0 ' Ut,0nar y fami,ies 

— rrz^~r— ----- — 

(a) proteases belonging to the EC 3.4.-.- enzyme group- 

(b) Serine proteases belonging to the S group of the above Handbook- 
(ci ) Senne proteases of peptidase family S2A- 

determination can be earned out fnr an «■ t ucn 

earned out for all types of proteases, be it naturally occurring or wild-tvoe 
proteases; or genetically engineered or synthetic proteases ^ 
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5 define pna,ease ac^For h " * * *" °'" hi * «« *• — <° 

pmfemad aL y P"n»sasc, t h b invention. ,ha sc-cated pNA Assay is a 

a^t,":~:r: o,i9 ' no,the ~ ° ,,he *~» 

proteases obtain!^ not or wild-type 

proteases, and cons l^Z ea^s -T* "* " ' 

is deneraily known in ,he ad « bv^ rt ?** """^ P-pared as 

j own in tne art, eg by Site-directed Mutagenesis bv PCR n.«in„ = , 

cantoning «ha desired mutation as cna of tha pdmars f„ ma PGR !^ 9rnen ' 
Mutagenesis. Tha preoaialion „f ,.„„ reactions), or by Random 

« ^inedWaausTbarnlrT^' 8 ^^^^ 897 ^^^ 
encoded by tbe nolta ^ ^ 3^™" "™ """ ~ >~ 
acid seguence mam the source 1 p^antTa^ - ^ " * 8 «*» -** ■» ""deic 
extracellulady. " 8 P ' e,erreti e * nb °<«™«. the polypeptide is secreted 

- -educed B .Z2£££ I 6 Pf0,eaSe 15 9 — * **- 'o invoke a 

immuncogica, * ^ **■*« The ,™ 

exposed ,oTe~ C^t^T T "* reaC "°" '* «""» «*- 

incased ieveis of ig E "n ^^iTTT' " " ** « *» 

techniques known T7 L ° W - a " er9e " iC ^ ■» prepared using 

With poiyme. may invo La *, 2cZ ZTJT T ' mmUn0l09fca ' 

in wo ga/ugag. wo MW^SStS wTT ^ ** as dMcrtb «' 

ar anernaavalv *— ,' ., . ° n0,0r WO 9W00489 - C°"i"»ation may in addition 

be achieved by genetic Z ^ZJZ^^'^ * "» """^ ^ «*■«*" ~» 
■ cansensus sequences enoZn V 1 *» «— ■ 

providing low-ailergenic vana.IV . ^ "* '* W ° °°' 26354 - Another <* 

Protease aa as to JZZ£ZZZT** "~ ' h ° 
Shield .he epitopes af other ^ ""-""S^ 6 ™' «"«*« •* P—» ™nome re may 
aiigomam. SuJ pn^aC ™ rS ^ ,hereby U,e antigenic af the 

"ducts end their pmparation is descnbed e.g. in WO 96/16177. Epitopes 



in EP 561 907. Once an epitope hes been idem* Ji » deSCribed 

euch es site dieted mutagenesis (see e , WO ^ZST man ' PUla,i ° n «•*■*■»- 

and/or conjugation of a ooh™,, m e e.g. wo 00/26230, WO 00/26354 and/or WO 00/22103) 

<o shie,d me eZe * ^ « to - * .he ^ 

~; z:x::~r ~ -r - — 

non-polar or uncharqed oolar ami™ •„ •«.. Po>ypept,de comprises at least three 

43 . J ' » b£ Q ID NO: 41 ; or SEQ ID NO: 

(b) camp** an amino add sequence which is at leas, 70%. or preferably 75* an* 
85%. 86%. 67%. 86%. 89%. 90%. 91%. 92%. 93%. 94% 95%^% 97V 98* 

3 po, y" ucte otide encoding the mature part of a protease said 
polynucleotide obtainable from genomic DNA from Afocanffopsfe 
dassonjmeisubsp. dassonvillei DSM 43235 by use of primers SEQ ,D 

SEQ : ,D 6 NO d ^ fr0m DSM 15647 by use of primers 

SEQ ID NOs: 34 and 35; from Nocartiiopsis prssina DSM 15648 by use of 
pnmers SEQ ,D NO*: 38 and 39; or from Noc^opsls prsslne DSM 15649 
by use of primers SEQ ID NO's: 42 and 39- 
00 the poiymideotlde of SEQ ID NO: 1 ; of SEQ ID NO: 2; of SEQ ID NO- 25- of 
SEQ ,D NO: 31 ; of SEQ ,D NO: 32; of SEQ ID NO: 36; or of SEQ ^D NO 

("0 - ub ^ence of (I) or („) of at .east 500 nucleotides, prefe.bly 400, 300 
200, or 1 00 nucleotides, or 
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(IV) a complementary strand of (I), (II), or (III)- 

^ Z2T 3 T UrS ^ 15 3 rarian ' ° ,We maiure ^ " 1,16 "-*0 
Ln T « 01 550,0 N0 * SE ° » NO: 33; SEQ ,D NO: 37; SEQ ID 

NO: 4, ; or SEQ ,D NO: 43: cousins a subs^on. dete 6 on. extension. and,or 
insertion of one or more amino adds; 

(e) is an allelic variant of (a), (b). (o), or (d); or 

(f) is a fragment of (a), (b), (c). (d), or (e). 

. Fo ' fte W°^o''hepresentinven a on,thede 9 reeofiden«tybetweent W oaminoacid 
stances, as well as the degree of ,den % behveen «o nucleotide sequences. Is deZned by 
10 to program -align" whid, is a Needleman-Wunsoh alignment (I.e. a glo a, alignment C 
pregram ,s used for a»gnmen, of polypeptide, as we., as nucleo.de sequences ThToeteun 
scoring ma,r„ BLOSUM50 is used for poiypeplide allgnmenU. and the default Kendal is 
r ZZTTT* " - - * - *P * -« * poTypepZ 

(1988), reproved Toote for Biological Sequence Analysis". PNAS 85:2444-2448 andW R 
Pearson ( ggo, -Rapid and Senate Sequence Comparison „,«, PASTP and FASTA,^Lds 
n En^ymology ,83:63-98,. PASTA protein atignments use Smitn-Waterman algoridT^o 

Clusia, reT^r ° f k ' er "" y be,Ween amln ° ^ 8 " qUenCe8 ma " atea •» <■—*«• by me 
TnK,T'°" nS ' 1989 ' C * 8,0e 5: 151 " ,53) «» "SERGENE™ MEGALIGN™ 

« P^remlreT T ' ^ *"> "* " ***** ^ ^ ,ha Si— 

n^a? " • ^ tta9 ° na,S=5 - ThS da9rea - « *» 

descnbed above vr.tr, (he foiling se«ings: Gap penalty of ,0. and gap length penalty of ,0 
Pa,re,se altgnmen. parameters are K.uple-3. gap pena,«y=3 and windows-20 

nolvn r«^° naOfa,ean ^ in9P0, ^ UdeO,ide ^ nces °'^ i -^ 

ant, andT TT"*" ' ^ " "™ «*» «* *"» *• 

eZtl T T ^ ' eTOinUS '° ami "° a ^ • •" one 

revues or ! fT^"" - " " ■* " a < laaa < «• — » «*• 

restdues. or a, leas. ,2 5 amino add residues, or a. teas. ,50 amino add residues, or a. teas, 160 
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same r^^^^^^^^^^^'on-ofageneoccupy^me 
same chromosome, locus . ^ ^ ^ m m 

within popu,a«ons. eene mutations ran to silen , (TO chanse in ^ ^ 
pjrypepbde) or may encode polypeptides having altered amino acid sequences. An allelic variant 
of a polypeptide is a polypeptide encoded by an allelic variant of a gene 

The present invention also rotates to isolated polypeptides ha Ving acMy ^ 

^a-^edbynu*,oac i dae qi *n OMWnichnylJrt<feeunder ^^ ^J^ToT 
m^ium. medium, madium-high. h«h. c, very high salng^cy conditions L a nu^o aX^a 
«*ch hybndizes under me same condntons with <„ a polynucteotide encoding a protease 
obtainable from genomic DNA mom «ocan,/opsis dasson«e/ eubsp. dassonW JoSM ^3235 by 

SBqT^T ' D N0 ' S: ^ ** 2?;,rom ,5647 byus^Zrs 

Zd 3^, t 3S ««• W « - Pttmers SEQ ,D NO* 

nucle,cac,d probe ,s selected from amongs, me nucleic acid sequences of (a) (b) »£ZL A 

r£SE r~ 9 to 0,6 ma,ure >** «*• - 

pre^^ 

3, SEOu 9 IT, ^r"" 065 * SE ° 10 N ° : 1 ' SE ° ID NO: 2 - SEQ '° •« 25. SEQ ,D NO 
amino h ' D M ' ° r SEQ 10 N ° : 40 ' " 3 Subs ^-» •"-«* as we» as me 

SEQ ID NO. 43. or a fragment thereof, and even a genomic polynucleotide encoding a protease 
obtainable frorn genomic DNA from Nocardtopsls dassonvlllel subsp. dassonvl/lel DSM 43235 by 

EQ^s^^ 
33 and ,o , "ocansopsrspnasina DSM ,5548 by use of primers SEQ ID NOV 

38 and 39; or from tVoean*^ prBslne DSM 15649 by use of primers SEQ ID NOs- 42 and 39 
or a subsequence thereof, may be used to design a nucleic acid probe to identify and' clone DNA 
encoding polypeptides ha*n fl pnotease a*v*y from strains ofdlrTaren, genera 
according to methods we,, Known ,„ the art In particular, such pnobes can be used for 
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rrrr me genomic ~ cdna " me ^ - ° f **»*■ 

a p u^,r: - — « shouia bG a ?L 15 e ™ 

corresponding gene (for example, with -p. »H. »S. biotln, or avidin). Suoh probes are 
encompassed by the present Invention. 

Thus, a genomio DNA or oDNA library prepared from such other organisms may be 
screened for DNA that hybridizes with the probes described above and which encodes a 

L pro,ease ao ° u " y - Genomic OT other 0NA from su * ™» * 

sepa atad by agarose or polyacxyiamide ge, electrophone. or 0 , he r separata techniques DNA 

' ZZ^T B or 7"" DNA may bs * n-mi to 3nd « - 

D NO. , or a subsequence thereof, the earner materia, is used in a Southern Mot For purposes of 
2 7 n, ,' 0Ven,,0n - «— «- •» "ueteie acid sequent hybridizes*, a 

lebeied nudeto acid probe corresponding to the nucleic add sequence shown in SEQ ID NO - 1 its 

^^^.«aaubseq U en« m e re of. 1 ^v^ l c Wto varyh^smn g er^ ' 
colons. Molecules to which the nucteic acid pn.be hybridizes under these conditions are 
detected using X-ray film. 

20 eonnr '° n9 H T*" ° f 31 ' eSSt 1 00 nUde °" deS len8,h ' Ve * low «° «* high stringency 

conditas are defined as prehybridlzation and hybridization at 42"C in 5X SSPE. 0.3% SDS 200 

WM sheared and denatured saimon sperm DNA. and either 25% formamide for very tow and low 

hL n8 T S ' 3 h 5% h ' 0rmamlde ' 0r ""*"" «*"**» ^ngendas, or 50% formamide fer 
h.gh and very h,gh stringencies, fotlowing standard Southern bloding procedures 

th r ^ l0ng i . P , r ° beS °' 31 ' 33S ' 100 nUOle ° tideS ,he ™ te *» "» Anally washed 

three bmes each for ,5 minutes using 0.2 x SSC. 0.2% SDS. 20% formamide preferably a. teas, a, 

eas^cTmtTT ^ * *** 3 ' ^ °°™ "™ * 

leas. a. 55 C (merta, stnngency). more preferably a. teas, at 60-C (medium-high smngenoy), 

30 ZZZT ' al ' 3aS,3,M - C(N9h ---'- d ^^3,,aa«a. 9 W cU 

For short probes about 15 nucleotides to about 70 nucleuses in tength. stringency 

Tox bZl 7 Prehybrid,2a,IOn ' - washing Pos.-hybridiza.ion a, 5X to 

10 C b^owtha cateuleted T. using .he calculation accruing ,d Bolton and McCarthy (1062, 
Proceedmos of the Nafir> n ~i a , * ~ . y v 1 



35 



D _ _„ af w ««w,iauwnu.iiy iu oouon ana McCarthy (1962. 

PH 7.6. 6 mM EDTA. 0.5% NP-40. IX Denhardfs soiuta. 1 mM sodium pyrophosphate. 1 mM 
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• u Sl „ g ex ssc a, s-c „ ,o-c ^ T """^ "* ^ Mch * 15 mlnutea 

a m ,„ 0 : ;«;i:rr: a *: chan9es are ° ,a minw — - * 

extent, such M OT L,„"!.l , 1 *™" ami "°- « «»»*«lM-n*« 

residues; or a smaTZ" IT ' e8idUe: 3 ™* " " P '° — *«* 

function, sucn as ^ZnaT T Wnf0a,i0n — * « 

, ^ y d " 1e " raC, ' a " ar " i » nit!e Ptope or a binding domain 

(^tneandaspLpl) Z^ ' 9Uten1 ° ° a * a8 ^° P°*^no acids 

amino adds (pher^Ta 1«1T !T ' S ° leUdTO ^ "** 

sedna. mreole Z^^S^T amln ° 

Ala^er, Val/lle, Asp/Glu Thr/Ser Ala/G „ """^ OC0Urrin9 «*»"9» are 

-n^rr,:;^ 

embodiment, tne residua, active „ a, ieast 60%. 70*. 80% or a, ,™ 

pro,e^r,r P ^r;:r2 ■sr* ° f - invanta * « a - ~ » • 

protease idedved £ Z ^ * " ^ ~'™<*a 

**l /ocentene*. ST^ZT ^ ^ 

or „ps,a «aseo n « i^^T' " ****** 

"«?/, ror example Nocardiopsis dassonvillei DSM 43235. 
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Garrtfy & J. G. Holt in Bergey-s Manual of SystemaBc Bacteriology. 2001 second edM™, , 7 
1. David R. Bona, Richard W. Caslenhofc. 0 "' V< * Ume 

5 tha JIT "r^ 00 " ' ha ' ,0f *• ^mentioned apaolas. the Invention encompasses both 
5 tha perfect and imperfect states, and other taxonomic equivalents e o anamomh.TT . 

identity of appropriate equivalents «wgnizeine 

nTr n W ° Care " W dessonvffl* DSM 43235 is publicly avallabla from 

incl„dinn U I em10re ' be — obtained from othar sourcas 

M*. micrcorganisms Isolated from nature (e.g.. so,,, composts, water. e ,o, using tha above- 

has ^7, , " m,0roOr9anlsm - «*» * sequence encoding a polypapbde 

has bean detected *th tha prob*,. «he sequence may be isolated or doaed by ulnT 
ta^ouas which are Known to those of ordinary sKil, In the arf (sea. a.o.. SambL Z, 1989 . 

« *. T T a " P °'" JeP " de * 3 <*»»«**> «-**•"• essentially free of 

« other polypepbdes. a.o.. a, laas, about 20% pure, preferably a. teas, abou,40% pure moT ' 

«M» about 60% pure, even more preferably abou, 30% pure, most ^bly abl 0 % 
pure, and even most preferably about 95% pure, as determined by SDS-PAGE 

fusad ?T enC<X,eCl " V nUdel ° SaqUenCaS ° ,,he * =lso include 

fused pdypepfdes or daavable fusion peptides in which another polypepBda is fuWtoe 

. N-tenninus G-terminus of the polypeptide or fragment thereof. A fused polypeptide 

produced by fusing a nudelc add sequence (or a podion thereof, encoding JLJTZli. ,o 

polypeptides so ma, they are ,n ftame and thai expression of the fused polypeptide la under 
control of the same promoters) and terminator. f yp P e is under 
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A preferred embodiment releates to a polypeptide of the first aspect which is an artificial 
variant of a wildtype polypeptide said variant having one or more amino-acid(s) added to the C 
terminus as compared to the wildtype; preferably the one or more added amino acid(s) is (are) 
non-polar or uncharged; even more preferably the one or more added amino acid(s) is one or 
more of Q, S, V, A, or P. 

Another preferred embodiment relates to a polypeptide of the first aspect, wherein the one 
or more added amino acids are selected from the group consisting of. QSHVQSAP QSAP QP 
TL, TT, QL, TP, LP, Tl, IQ, QP, P|, LT, TQ. IT, QQ, and PQ. 

The inventors determined, that the polypeptides of the present Invention were produced in 
even greater yields when they were expressed as mature proteases fused to a heterologous pn> 
region, as shown in the examples below. 

Accordingly, a preferred embodiment relates to the polypeptide according to the firet 
aspect which comprises a heterologous pro-region from a different protease; preferably the pro- 
reg,on is derived from an S2A or S1 E protease, and most preferably it is at least 70% identical, or 
preferably 75%, 80%, 85%. 86%. 87%, 88%, 89%, 90%. 91%. 92%, 93%, 94%, 95%, 96%. 97% 
98%, or 99%, identical to the pro-region shown in position -166 to -1 of SEQ ID NO: 28 in position 
-166 to -1 of SEQ «D NO: 30. in position -167 to -1 of SEQ ID NO: 33, in position -165 to -1 of SEQ 
ID NO: 37, in position -165 to -1 of SEQ ID NO: 41 , or in position -165 to -1 of SEQ ID NO: 43. 

When the particular C-teminal amino acid configuration of the polypeptide of the invention 
was combined with an heterologous secretion signal peptide fused to the N-terminal part of the 
polypeptide of the invention, a synergy was achieved and a greater yield resulted. 

Accordingly, a preferred embodiment of the invention relates to the polypeptide of the first 
aspect which comprises a heterologous secretion signal-peptide which is cleaved from the 
polypeptide when the polypeptide is secreted, preferably the heterologous secretion signal peptide 
.s denved from a heterologous protease; preferably the heterologous secretion signal peptide 
comprises an amino acid sequence having a sequence identity of at least 70%, or preferably 75% 
80%. 85%. 86%. 87%, 88%, 89%. 90%. 91 %, 92°/o. 93%, 94%, 95%, 96%, 97%, 98%, or 99%, 
with the amino acid sequence encoded by polynucleotides 1 - 81 of SEQ ID NO: 2. 

Nucleic Acid Seauenres 

The present invention also relates to isolated nucleic acid sequences that encode a 
polypeptide of the present invention. Particular nucleic acid sequences of the invention are the 
polynucleotides of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 25. SEQ ID NO- 31 SEQ ID NO- 
32, SEQ ID NO: 36, and SEQ ID NO: 40. Another particular nucleic acid sequence" of the invention 
•s the sequence, preferably the mature polypeptide encoding region thereof, which is obtainable 
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*. -no add sequence JZT^Z VST?** 3 "»«** *«* 

SHQ O NO.- 43. which nudeic a . d Cu^ J^ 18B ' " ^ ^ *° ^ ° f 
5 1 by virtue of the degeneracy of the gJLTc^LJ ™ COTCSpo " d " , 9 Pa* of SEQ ID NO: 

subsequences of of the ab Je pZuT^ ^ '~ '° 

protease activity. v*,ch encode polypeptide ^ ^ 

nudeotides ^eTa^rsCnd^oel'dr, 6 ! ^ Se0U<SnCe *" n WhlCh °™ ° r ™» 
» fees, 225 nucleoodas , ^ 3 £ a subsequence coraans a, 

4=0. 500, 53, 600. 700. 300. 000 or C^STET T """ " — 375 ' 

nudeoUde sequences which haV e a deoree „ml ^ ^ ' nVBn,i0n ate ° retotes '° 
SEQ ID NO: 2. SEQ ,D NO- 25 SEQ pTfJo Il c "* * «EQ ,0 NO: ,. . 

NO: 40 o, a, .as, a5% , £ ttTZ, ^ " ^ * « ° ^ * « « i 
« ^techniques used. 0 i s d a te r^ ,97 ' 98 ' Ora,least99% - 

combination thereof. The clonino of lhe nucl I T ^NA. .preparation from cONA. or a 

~ DNA can be effected. e.g by lg L ,T" " " *~ *» *» ~* 

anybody screening „ *VT — -dion (PGR, or 

» cruras, see. e.g., , nnis e(at . I^^t^*"*"* "* ^ ~ 
New vod,. char nuclalc aek) ampii^on 

"gated activated transcription (LAT) and „1 ? 9336 ° haln «-CR). 

ba used. The nuciaio add J^CHT, ——» (NASBA) may 

-tatad organism and thus, for exampT IT ?? * ** * - another or 

encode ragion of the nudeic ac^enl ^ "** « "» W ~ 

The term -isolated nucleic add sequence" as u««, h • . 
which Is essentially free of other nudeic add T * to 3 nU * lc **< se quence 

- - — «* mora P rz : ~ 6 ; 9 -- * - , ab ° u ' 2o% - ~* 

about 80% pure, and most preferably atLjT ^ TO *" pre,erabl >' a t least 

a '-~,Por^^^^^ 

doning prooeduras used in genetic anoint , ^ 03 ob,a,ned b f ^ndard 
natural locatton to a d«erej ^Z^TZLTT ~ —» — - 

exdsion and isolation of a desired nucleic ^ k ^ d0ninS procedures ™y involve 

encoding the polypepUde. Insertion of £££T " mPrtSin9 ""** aCid M ^nce 

— ---ahoa,c.,r^rcrdrr m r~ onof 
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sequence will be replicated. The nucleic add sequence may be of genomic cDNA RNA 
semisynthetic, synthetic origin, or any combinations thereof ' 

For a genera, d^ptonotnuc,^^^^ ZT 

mutagenesis /spp r»,.„ • t mutagenesis or alanine-scanning 

agenesis (see. e.g., Cunningham and Wells. 1989. Science 244- 1081-108^ m *~ . « 

rrTJ?S, 7£Z1 ' ■ J. da Vos 
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r ^rcr.-rzz:-r-r-- 

polypeptide fragment which has protease activity 8 

S5=F==== 

extend , Ude< *"' e P " mers ' Mch <*>n>Plementa,y to opposite stands of the veotor 
extend dunng temperature eydin, by means of Pfu DMA poiymerase. On incorporation Z2 

Nucleic Acid Constn irte 

« « .solated from a naturally occurring gene or whioh has bean modified to 
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«*en .he nudeic add ^£^177"™ - - ~„ ™ 
ooding seance C ma presem ^^ZCr 0 " ^ ** " 3 

* nudeic acid sequence ma, direct* spednes 1 JT, ' ' S defmed - - 

boundaries of the coding ^u^^l!!?? "l-ncatf teprotoln product . ^ 
(p-okaryo.es) or by .ha ATG - * dBtem *»« " y 3 r " >OSOme bM "9 — 

*■» o^en readingframe a„he 3'end "C^TT"*™* "—>«——. 
-0 iim..ad .o. dna, cD NA . ^ ^ « « * no. 

nuoiaic add sequence to r Zr„Z 10 ? i Ir XPreSSi0n ° f M — of fhe 

on the expression vector. T„ 2 ZTta 17" ^ ^ ^ " » 
« recombinant DNA methods are JZZX^T *" "~ 

neoess™^;:::;?,:: is defined here,n to - -~ - « 

oontro, sequence rnat Zal o , ° " "» ^ *«— ■ ^oh 

20 propeptide sequence, promoter sin™- „ ' "O^njteBon sequence, 

minimum. lhe oondo, s ™ 31T T""' " d Ata 
signals. The con.ro, saqZ™ t "TT ' *«* «°> 

nudaic add sequence encoding a pcCC r^ "* 
» oonflguradon in which a comro, s^tt * P "**' " de "" ed - a 

coding sequence * me D^ZZI ^ZT^-^ * ' P "* B ^ to - 
polypeptide. ^^'^'^^noadiraotethaaxprasaionofa 

> sequence contains transcriptional cant! ** ""V*™*' ^ *"™*<* 

Po^dda^apromoJ^^rr^^ 8 ^ 0 "^ 
in .he hos, cal of choice indud „g J^tZlTT" 

*om genes ancomnq e^Ua^t^^T ^ **- 

to the hos. cell. intracellular polyposes either homologous or heterotogous 
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Examples of suitable promote* for dicing the transcription of .he nucleic acid consul 
of .he prase* tnvenbon. especial,, ,„ a bacterial hos. cel,. are ,he protests obtain* f ro m 

TZ^TT™ ~ - — ~ ^ — 'evansucrasa gene 

0*cB). Bac*s «cWom»s alpha-amytoe gene (arnyL). 8a*s stearomermopnte maltcJenic 
amylase gene (a my M) . SeCus amy^uefstfens alpha-amylase gene <amyQ>. L»„s 
Wr*. peniCllinase gene (penP,. Be*s sub„„s xy* and xylB genes and proxaryotic 
be^aarrase gene Wa-Kamarof, e, e... ,978. Proceedings of .he Nabona, Academy 
Soenoes 75: 3727-373,,. as ^, as .he tocpnanoter (DeBoer e, a,.. , 9 S3. Proceed^ of 
K yO,S * nCeSUSA80:21 - 25> - ^™ersaredescnbedin-UseL 

ai TmaZr Sc,en " nc Ame ' fcan - 1 980 ' 24i ^ - * S — oc* a. 

of the nrrT 68 T^""* Pr ° m °** S d " e<:Bn9 «"«"**» of.be nucleic acid consb.cts 

fo, Aspe**, oryzee TAKA amylase. Rh/zomucor m/ehe/ aspartic proteinase. Aspeng/»us „w 
neutr, aipha-amylaee. Aspe^us nfcer add stab* e,ph Mm y,asa. Aspen^s n^or 

piriTZ 9,U °° am " aSe ^ '^' »Pase. Asperse oryzee a.«a«ne 

127' ^ Ph ° SPha,e ' SOmeraSe ' «***» aoetamldasa. and 

lTr,i ,VPSin "" ke Pro,6aSe ,W ° 9S/0 ° 787 >' - NA2-,pi promoter a 

hybnd oNhepromoters bom me genas for Aapa^us nigerneuba, atpha-amylase and 

fheZ "°™* MOS9 Ph0SPha ' e iS ° maraSe '' ""' mU,an, ' a " d ■»»»•"» 
In a yeas, hoe,, useful promoted era obtained bom «,e genas for Sacchammyces 

c™eathTrr s ^^ 

ce^atae alcohol dahydrogenase/glyce^ehyda-^hosphate dehydrogenase (ADH2/GAP, and 
aCZT 7 ^hosphoglycerata W „ase. Omer usabt, pnomoters for yaea, hos,^, 
are described by Romanos e( a/.. ,992. Yeasf 8: 423-488. 

The control sequence may also be a suitable tranacnption terminator sequence a 
sequence recognizee by a hoe, ca,, ,o terminate benscripbon. The terminator eequence is 
operably bnbad to ,ba 3 terminus of,ha nudeic acid sequence encoding ,be po^peptide. Any 

**** h ,UnC0 ° nal h ,he - - " — ^ used b, ,be preZilrboT 
Preferred te,mina.ors for Blamemous fungal hoe. cells are obteined from me genes for 

tTT. ^ amylaSe ' *- 9 ,ucoamy,ase. rW„ua nfdu/ane 

amhrenilate synthase. Asperses nrgeralpha^Krcosidase, and f=uearf.,m oxysportrm bypsln-like 
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Prefer feadars JJLT "*« ln " e *"- 

SaccWn^ ca^vfcfee alpha-factor and SaZT 3_Ph ° Spho9 '>' cerate 

invanHon ^ ™ h,Ch " «* of choice may te used in ^ ^ 

«***» an*™, JaymhT^^rT "*» »W 

alpha^ucasidasa. <, ^" Jm '^^a protaasa, and Aspergillus niger 

into ft. cars saona^v p*C ThaTand rffh ^ ** "» «— 

-ay .nharanOy £Z ^ °' "» " Ud * acW 
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coding sequent The foreign signai pepMe ^ re9 , on ^ 

sequence does no, naturaily con,ain a sign^pada ^ reglon . ^^^1 

i the exp^sad D Le2„ P ,°T HOWeVer ' si = nal ""ding -eoton which direCe 

p re ;~ n ° P ° ^ °~~< « ' - - — ^ — .he 
Effective signal peptide coding regions for bacterial host cells are the ston-l „» n ,iH= h- 

IT FurttT — 1 • (nPfT. nprs. nprM), and M. erLs 

peptide^' ^ 5i9nal PeP " <te CO< " ng reBl ° nS " lamen,0US *"■- "<*• — » ».» signal 

Hum,co/a«so)e nS callulase.andH 1 , mfco , 8tenug , nosalipase eprotemase. 

ce^^ttl^T; ^ ^ are0b, * ed frOT "» senes for Seccharomyces 

cemws,ae alpha-lactor and Seccnaromyces een^ae invertase. Other uaefu. signal peptide 
cod,ng regions am described by Romenos at a/., 1 gg2. supra 

eeguence'porlrar 08 ^ *° * 3 "~' ^ — <* - «*» •* 

sequence posted at h. am.no tenninus of a polypeptide. The resultant polypeptide is known 

•"•">««-» « propeptide for a aymogen in some cases). A *Jg£ t geneZ 

Saccftan^^^aipta.^^^ ^^^^ J 
A^ce//op«hora(rie™op(,»alaccase(WOgs/33836). Proteinase, and 

NO- , Ih T'T emb ° dimenl ' *" «"P"P«*> °°*9 region is nucleotides 1-498 of SEQ ID 
NO. 1 which encode amino adds -16S to -1 of SEQ ID NO- 43 or s>eq id 

potoeoTTh 15 * a ' 9na ' PeP " de ** ProPeP ' ide re0lOnS are - «- «*» terminus of a 

STZT t Pr0P6P,i<,e re9fen " P0 * ned ^ '° ' h9 «*» •«*-. of a polypeptide and 
me signal peptide region* ne „ to |he amino ^ ^ ^ P / P P r 

expres^fC * deS ' rab ' 6 '° °" «*> *>» •» of ma 

systems are those wh«h cause the exp.ssion of me gene to be turned on or off in response to a 
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*^«Mmm^k^m, fmmm ^ mn9t ^ v compound. Regubto™ 

~ n rrr ^ ino,uda ,he ^ ,ac - - * ~ — ■ ■» 

system or GAL1 system may be used. In filamentous fungi the TAKA «.nh a a / 

i 

Expression Vectors 

acid seZZ^T 0 ; ^ 10 reC ° mb,nan ' ~ «W a nudeic 

sialrZ 0 "' 3 Pram ° ,er ' and "a^ona! and translationa, slop 

= ,e VBCto for expresston . In _„ ng ^ expresston ^ ^ ;~ ° - 

The recombinant expression vector mav be anv v^rtnr „ 

The vector may be an autonomous* replicating vaotor, i.e.. a vector which exists as an 
e^oh_a, end*. ,he rep.ioa.ion o, which is independent C chromosome, . g 

IZZZ araC " ro ™ 1 — * « Chromosome, or an ««at chrj 2 T hT 
vector may con«a,n any means for assudng se^pticatfon. Attematively. the vector may be one 
wh-oh when tntroduced into the host ce». is inlegraled into the genome and repiUdToler 
MM. chromosome*, into which „ has been ,„ t e 9ra ,ed. Furthermore, a *££L 0 ^' 
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The vectors ofthepresent.invention preferably contain one or moroo . * 
«n*,c,„ ace ¥traTO , era se,. hyps (hygn^y* phcaphoa^eferase) (^Te 

genome by homologous or nonhomologous recombination. Alternatively the vector mav 

riz number - nucie,c ^ -* - 100 to — — . p»sl 

W base paae. and m os. preferabiy 800 to ,,500 base pa,rs. which are high* honjg^n 
rt» eton»n«s may he any sepuence «ha, ia homoiogous w» „ ^ et Mqu- ^ 

eZT ^ ^ inte9ra " 0nal *"»" ™» ■» - en Jin JT 

encode nucie,c acd sepuences. On the Che, hand. «he vector may be in.epra.ad Into toe 
penomeof.her^.cellbynorWrerno.opouarecornblna.lon grated rnto (he . 

> Fo ' a *°"^'<»«ca.ion.,heve^^^ 

enabhng .he vector to repBcaie autonomously ,„ .he hoa. ceil in queshon LZZZZL* 

PACYC184 perm,tt,ng replication in £ coli, and pUB110, pE194 DTA1060 
• micron odgin of re^. ARS1 . ^ tne of ^ ^ ^ ■ - 2 
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the Nations, Academy of Sciences USA 75- 1433) ' * 0C "*«- <* 

The procedures used to »ga.e the elemenls described above lo consbuc. the re co mhina , 

phospholipaseAl EC 3.1.1.32); phosphoiieaseA2 (EC 1 1 i,u v. .. 

P^Phelip^seC (3.1.4.3); pJpMpaJo K?, 44, t > l ***' WIEC1US » 
EC 3.2.1.6). °spno»pase D (EC 3.1.4.4); end/or bete-glucenase (EC 3.2.1.4 or 

mixutrelf 9 t l 2 rr n ' ay ^ ~ «— * •» one vector, or using a 

* n "" ,es •^^~««-.*-«*»- V l-.d»™ 1 , 

se.ectab.e markers, and different origins of replication. When using only one vector the oenes can 

.he order ,„ which the genes are doned may affect the expression Jet j me 
protetns. The protease may also be expressed as a fusion protein. ,.e. Jthe gene en^na the 
protease has been fused in frame to me gene encoding another profein. Tm JZ££ 
anotherenzymeorafunotionaldomainfromanotheranzyme «nmaybe 

oolvnuc, *r d '" 9 ' i '- ,he ' nVen,ton *• *> a nacombinan, expression vector or 

polynucleobda constntot comprising e po.ynudeo.ide of the invention. 

Host Cells 
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extra^hromosomal vector as described earlier. The term "host cell" 6nmm 

unieeMar mtooo^anlsni. e.g.. a eoKafyofe P ^ " 3 n0n - 

bacteria suob as iZ Z ' °* " l "* ° r 9™ -«— 

embod™,. ,he baderia, hos, ee» is a Ba*a ,en te , Bacfc * wl/ BeltT 



(sea. a,! ££^£T£ * 742 - 75, >' " — 

norne « 1 987 - bourns/ of Bacteriology 169- 5771 -527m Th* 1,^* .. 
be a eukarvote such « a r.«„ u o"i-o^7B). The host cell may 

herein includes ascosporoqenou* « B «t /c.^ ... ed 
baling fo the Fu '' baSidiOSP ° ra 9 aa0aa - »— 

-he fufure, for ine P^osTo^ntZ^ kT^ ° ,aSSi " Ca "° n " »« 

of th K .nvanton, yaast shall be defined as described In Biology and 
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j* (Skinne^ ■ F A " PaSSmWe ' S M " 3nd DaVenport ' RR ~ e ^ s ~ 

Symposium Series No. 9, 1980). 

The yeast host cell may be a ctndfe, Wansenu/a, K/uyve/omyces, Pfc/wa 
Sacc/»a/omyces. Sc/7/zosacc/iaromyces, or Va/Tow/a cell. 

The fungal host cell may be a filamentous fungal cell. "Filamentous fungi" include all 
filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawksworth et a/ 
1995, supra). The filamentous fungi are characterized by a mycelia. wall composed of chitin 
cellulose g.ucan, chitosan, mannan, and other complex polysaccharides. Vegetative growth' is by 
hypha. elongation and carbon catabolism is obligately aerobic. In contrast, vegetative growth by 
yeasts such as Saccharomyces cerews/ae is by budding of a unicellular thal.us and carbon 
catabolism may be fermentative. 

Examples of filamentous fungal host cell* are cells of species of. but™* limited to, 

. Aspyn*. Fusariuo,. Hum/cota. AW. My oa //opftfflora , Nmmspols , Perdcmum 
Thtelavia, Tolypocladium, or Trichoderma. 

Fungal cells may be transformed by a process involving protoplast formation 
transformation of the protoplasts, and regeneration of the cel. wall in a manner known perse 
Suitable procedures for transformation of Aspergillus host cells are described in EP 238 023 and 
Yelton et a/.. 1984. Proceedings of the National Academy of Sciences USA 81- 1470-1474 
Suable methods for transfonning Fusarium species are described by Malardier et a/.. 1989. Gene 
78: 47-156 and WO 96/00787. Yeast may be transformed using the procedures described by 
Becker and Guarente. in Abelson. J.N. and Simon. M.I.. editors. Guide to Yeast Genetics and 
MCecuiar Biology, Methods in Enzymology, Volume 194. pp 182-187. Academic Press. Inc New 
York, .to et a/.. 1983. Journal of Bacteriology 153: 163; and Hinnen et a/.. 1978. Proceedings of 
we National Academy of Sciences USA 75: 1 920. 

The invention relates to a recombinant host cell comprising a polynucleotide of the 
invention, or an expression vector or polynucleotide construct of the invention. Inapreferred 
embodiment, the recombinant host cell is a Bacillus cell. 



Plants 



The present invention also relates to a transgenic plant, plant part, or plant cel. which has 
been transformed with a nucleic add sequence encoding a polypeptide having protease activity of 
the present invention so as to express and produce the polypeptide in recoverable quantities. The 
polypeptide may be. recovered from the plant or P .ant part Alternatively, the plant or plant part 
conta,n,ng the recombinant polypeptide may be used as such for improving the quality of a food or 
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feed, e.g., improving nutritional va.ue. palatability, and Theological properties or to 
antinutritive factor. properties, or to destroy an 

or ?T onous (a di00,) or monoco *•*»■» <* 

(blue L^pIw " <rfm ° n0C0 ' P ' anlS are ******* as meadow grass 

. Plants (family Brassicacaaa) such Z £T a °° a°tf>aan. and cruciferous 

chiasm are cens^JaZ ^T^t^T' VaCU °' e ' ""^ 
origin, is considered ,o ba a plan, Z ^ "* °* «« 

P-aa, pa^nTZ «T " "~ ""^ '~ ~ "» ^ ^ 

appropriate reoulatan, M „ po,ype P t,de of th e present .nvent,on operably linked with 

useful for Keying ^ ^7*" ""*« -» <~ • — «• "afcer 
sequences ne JssL fo ZTTT PreSS '° n ha * inteoratetl aM ° N * 

onio^zrrr^^^ 

and now me po»pep«de Is desiradTo C~ ^T" T " ^ ^ 
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developmental, stage or tissue SDecifie an w ^ 

or W par, such as seeds or ZT^T ^ " * ~ '° 3 SPB * 

•issues sue as t " *" ^ 

863-678), a seed specific oromlT^ t ( "° * 1994 ' P ' a "< «**• «* 2* 

tarn nee <W„ a, J£ Z<1 9 ' U '*' ^ " albUmi * 

omersaadspa*^^ Za^e^™""^^^^"^"* 

674). or a wound inducible DromnL , h ,t Ganara ' 248: 666- 

-Wac^a^^o^T M ^ P ° te '° ^ Prom0tar <*" a <°'~ 

0 Protaasa^lmCr T "* ya ' S ° * '° 

^on.FcCcaT :^«TT^^' P *^-*''«- 
9ene to enhanoe-exp^on ' ' ^ "» « **» — - -* 1 

The nucleic acid construct is incoroorateH . 4 
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15 



totogy 19: 15-38). However H can also be used fo, transfers monocots aBhouQh Dlh 
■ensfomtatfcn ntetnods are 9 enera,, y preferred W w^nte % 
for penary ^ sgenic monooots js ^ ^ ^ ° ' 

Molecular Biology 21 .415-428. 

Following transformation, the transformants having incorporated tharoin 

seouence encod^ a p^eplWe having protease ^^ZZi" 7 

dairj n p • ° r ^ ^ c ° mp,is,n9 a °* m 

delm 8, or en express veotor or polynucleotide construe, of the invention. 



Animals 



sequence encoding , ma ' °™ ' rans, °™° ««" a nucleic acid 

^ZdtirT haV ' n9 «** ° f ' he * *> as to 
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into fenced eggs, and preferably pennanently integrated Into the chromosome Once th. . 
Academic Press. <*Pr«s,on. J. M. Fernandez and J. p. Hoeffler (eds.). 

Methods nf Prnrti , r fj nn 

20 pdypepHde "^'"^^'n - sepernatem; and optional* <b) recovering, he 

carton and n.trogen sources and inorganic salts, using procedures known in the art SuitabT " 
product, or disappearance of a suh^te Fn , SpeC,fiC ant,b °d«es. formation of a 
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The resulting polypeptide may be recovered by methods known in the art For examn.. 

but not ^ Ration, filtration, extraction, spraying, evaporation, or precipi JoT 
The po.ypept.des of the present invention may be purified by a variety of procedures 

ZTo Z ,n T 9 ' ** " Kmited * CM, ion exchange In 7 

hyd ophob.c, chromatofocusing, and size exclusion), e.ectrophoretic procedures (e.g pTeparaBve 
soe ectnc focusing), different so.ubi.ity (e.g.. ammonium su,ate prestation, SoIp " 

z:z { :z; 9 '' pmtein p ~ j - c - janson and ^ — v CH ^ 



ComPOsittnnQ 



POlypept.de of the present invention. The polypeptide compositions mey be prepared In ■ 
accordance wtth methods Known in the art and may be in the torn, of a iloJVa dry compositton 
The T":T PO, ~ ""^ be " - *"» <* « ~ - a miac^Z 1 

ESSE : be T - in ,he -~- ^ - — - * — — - ~ 

20 Animal Fet=>ri 

The present invention is also directed to methods for using the potypeptides of the 
mvenbon m animal feed, es we,, as to feed compositions and feed addZLpds L the 

Examples ofanrmals are non-nmtinams. and ruminants, such as cows, sheep end hontes ,n a 
; a ^-^™n'^- lm a, fe anon^an,anima,. Nor^inan. JL. InZemonc- 
Sastnc antmals. e.g. pigs or swin. (including, but no, limited to. piglets. growing p„s. and sows 
peuttry such as turireys. ducKs and chWren (inking but not iimited to broiler"! .ayenT 

crustaceans Onctudlng but not limited to shrimps and prawns) 

' 0 » mm ? e ' e T f ^ 0r ' MdCOTPOS " i0nmaaraanyMmwu "*P re W^.^a.o, 
composton suitable for. or intended for intake by an animal 

sim,,, J" aCCOrdln8 10 ^ ,nVenSOn *" """^ ran "* fed «» •*»■ before, after or 

simultaneously with the diet The latter is praferod. 

5 when J." ' " TT emb0d " neW ' ,he » <" - *»» in which i, is added to the feed, or 
when be,ng .nCuded in a feed additive, is weWefined. WeMefined means that me ^otease 
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preparation is at least 50% pure as rf 6 t 0 rmi n ^ u 

12 of WO 01/58275) ,„ leT-H^ * *— **" *-™«og ra pW (see Exampte 
80. 85. 88. 90. 92. I „ Z ZT V ' "T"* " "~ * - ». 70. 
Potion „ advent F Xtr:; *"* P^- 

• Please M b essetl y freeC^ Z£T 1 "~ 

oorrecfly refers in particular to toe l l 7 contammabng other prolaaaaa. Tha tarm dosa 

compcai,^^ Z ,h8P ~°'- » — intomrediato 

subject to a much hlohar batch L. . k ' ' S ° """ y ° blained and <*» 

M a particular embodinC ,h T * ^ Wlh ««S™-- 

solubllislng 'eg « ZI a ' aCC ° rd,n9 *° * hS 15 -P— * 

Exa.ple ,,. ^ 8U " abfe — » ' 0r «***- l«* is nMosad In . 

va 3 atob,a. In«udi„ mc^ed P^r^Z^^T* IT *" 3 

fcn% Fabaceae. e.». soybean, topine. pea. or baan. P ' antS * ' h ° 

In anolhar particular embodimenl, the veoetahle mu. 
mora plan* of the Bmlly Chenopod^a,. £EtELT* T^' ,rom ™ °' 
examples of vegetable protoln sources ere ra^ and c^' Tt " qU ' n ° a 
vagatoble protein source. Other examples o, vegTbto ptC 3 P "*™ d 

bade,, wheat. rye. pa,, malze (com,, L. and so^hum " " 
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Part '°' J ' ar emb0dlmen '' "» P ro '^= 'or use according to iIwention „ 

^ P— » - or digest p^n „ detemined using ^ T ^ ' 

' pro-e ases « 7 ° f % di96S,ed " Pro,el " «***• «•". the 

* f. 5 *'" *"** o-^oodiment. the for use encoding to the inventon is 

102%, 103 A, 104%, , 05 %, 106%. or at least 107%. relative to a blank The DH is de1nm,i„J 
us,n 9 he ntonogastrtc*, ^ m0 de, of Srantple 11. The fe ,,ov*„g are ^Z J^ 

Itlt eC"^ ^ 10 3 ^ ° H * — usingtheeguacu.ure, 

ones., " « ^ 6mb0< " men, <* a *») •«*»«* P™ess of the Invenuon, the pnMeasefe, In 
question ,s affecting (or acting on. or exerting Ke solubiilsin, Muence on, the vegetal Z,„ 

T aqueous solvenl such as and •» pH a " d - os are 

ueaCl ? " 8 " - '° *«**«- ° f *• «"»»» In q ues«on. For example the 
^ment mav take pU.ce a, a pH-value a. which the activity of the ec.ua, protease is a. leaTa, 

ZZ\™L7' 7 :i 80% or at ^ * — * - ~ n 

W £ T ^ 'hoacuvltyof the ecfua, proteese is a. teas, 40%, 50%. 60% 

*> * 60% or a, leas, 90%. The above percentage acMy indicate are **, ,o the max.™ 
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---- f--:::r:rz3^rrzr^ OT 

' ^Co^~t rt r; o,an an,mai feed TOans - 

y« 5DD ,.iiy or the proteins, thereby leading to increased protein extrarti™ f mm ,k w, . 

re,a«ve ,o con.ro. expert™,*, « no protease add iti on ~ 
EEC TT" raU ° <FCR) ^ Wei9h ' Sai " ™* "° « - Scribed in 

— . The fced add lti ve may a,so contoir, a. ,aas. ooe ^2 '"' - '~ " *" 

Further. oplional, feed-additive ingredients are ccfourina aoents » mma ™ 

P*—. Ceding OTfifimgal poiypeS^dZT? ' 
en^rne se.ac.ad front atrtongs, phyt8se (EC M J J^^^ ~ — 

(K (EC 3.2.1.22,; pjL (EC 3 4 . , 
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JTT" A1 (E ° 3 - 1 - ,32 »- A2 (EC 3.1.1.4); tysophosphoiipase (EC 3115, 

Tc 3^ 6 r (3 - 14 - 3): Ph0SPh °" PaSe ° 3 , - 4 - 4,: «""* ^™ 3 ,1.4 of 

5 prote^z^ 

2nnm p.^ • ^ clOTemn . Lactoferncm, and Ovtspmn such as Novispirin (Robert Lehrer 
10 anuria, activl*. CT/DK ° 2/00812 ' aSWe " aSVara " teor ^™*of tt « a bove l ha. I B«a i n 

th» fM H a 1,00 totflefe ed. whereas macro minerals are usually separately added to 

includedto Partl0U ' ar r* 0 *-*- 0,6 animal «- * «» invention la intended for being 

,nc,udad (or prescnbed aa having ,o ba included) in animal diets or laad at lavaia of 0 01 to 7o Z ■ 
» mora particularly 0.06 to 5 0%- or 0 2 to 1 nv m „ , • 0% • 
particular tor premi xes. ' roeanln9 9 ^ 100 ■ feed >- ™* '* » » 

The following ara non-axdusiva lists of examples ofthaaa components- 
vitamin O " a '" S0 ' U '" e ~ V " amln A ' ° 3 ' * mi " E ' and vitomin K. a. g . 

5 B2 JIT- ° f Wa f 1 T" SOll,bte Vi,m,l " S are ^ mi ° 612. biotin and choline, vitamin B1. vitamin 
B2, vrtamrn 86. nracn. fate add and panthothenate. a.g. Ca-O-panthothenate 

cobalt. ^ °" raCe " ~ ma " 9aneSe - ir ° n - *-* — *». and . 

Examples of macro minerals ara caldum, phosphoros and sodium 
The nutotiona. retirements of these components (exemplified with poultry and 
prglets/prgs, am „s.ad in Tabfa A of WO 01/68276. Nutrttoa, ra qulrement Zs J these 
components should be provided in the die, in the concanbations indicated . 

IndivJT 6 a " ema0Ve ' "* anlma ' feCd ° ,,he at least one of tha 

md^duat component, spewed ,„ Table A o, WO 01/68276. A, ,aas. one means either o, one or 
more of. one. or or three, or four and so farm up to al, thirteen, or up fa a» nfteen fad Z I 
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components. More specifically, teis at leas, one MM* component is included in tee additive of 
he invention ,n seen an amount as to provide an in-,eed^noan^on within tee range McL 
in column four, or column five, or column six of Table A. 

The present invention also relates to animal feed compositions. Animal feed composKions 
■ te-afive* hig* content o, pmtain. Poutey and pi g die* can be characZTas 

,n rcated ,n Table B of WO 01,53275. columns 2-3. Fish diets can be characterised as ntfcaL in 

s T ,0 us 091779334 WMCh is hereby ■—»— * 

800 9 ta anZr^ C ° mP ° S "' 0n aCC ° rd "' 9 '° *" ^ 3 ^ «— " ». 

800 9 /kg. and furthermore compnses at leas, one protease as claimed herein 

Furthermore, or In the alternative (to tee crude protein content indicated above, tee animal 
feed compose o„he invention has a content o, metaboiisable aneroy of ,0-30 Mdfiro anc/Ta 
content of cafcfum of 0,-200 g*g; a„d,or a content of ava«ab,e phoXrus o,0, io ^ 

u.1-1 50 g/kg. and/or a content of lysine of 0.5-50 g/kg. 

In particular embodiments, the content of metaboHsab.e energy, crude protein, calcium 
Phosphorus methionine, methionine P ,us cysteine, and/or .ysine is within any one of ^2 3 
4 or 5 in Table BofWO 01/58275 (R. 2-5). 

w&T^T^Tr* (N)mu,Bp " ed 62s - ■* ™° 

Tm Z^ 9 ! I f COnte '" iS delemined by ,he ^ eklahl "-a* (A-O.A.C, 

wlnZrr dS °'~ 14,h --~»'--'^oa l Chemiste 

Metaboiisable energy can be calculated on tea basis of the NRC publication Nutrient 

r: ** ' eViSed ,988 ' *" - - ^ "« °" — on 
enrma, board of agriculture, natfcnai research council. National Academy Press 

£ETT f • ,* 2 ' 6 ' ^ ^ EUroPea " Tabb °' EnaW ^ *» Peed-stuffs, . 

S2TrtTp e ^ ~ aXtenSi ° n ' 7361 DA Tne Nethertands. 

Grafisch bednjf Ponaan & looijan bv. Wageningen. ISBN 90-71463-12-5 ' 

The dietary content of calcium, available phosphorcs and amino acids in complete anima, 
«ete ,s caiculated on tea basis of faed tab.es su* as Vaevoadartabe, 1997. gegevans o er 
oham,s*e samenstetilng. vertaerbaarhaid en voedervraarce van voedermiddelen, Centra. 
Veevoederbureau, Rundenweg 6, 8219 pk Letystad. ISBN 90-72839-13-7 

one vedZrr '«* <=°^'«°« <* «*> invention contains at least 

one vegetable protein or protein source as defined above. 
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0-30% eats; and/o 0^12 7 ; ant "° r °" 7 ° % *"* "** °- 70% «*- 
Anima, rT : and "" 0-1 ° % fBh mea '' 0-20% whey 

minerals are add.d=, rf . ? am ° Un,s ^ essenUal *»*» and 

added .erare a ^ZZ ' I^a * 7"* 3 ^ ^ " *** 

ans. me dosage m mg enzyme pralein par kg feed Is calculated. 
« a "* determinin9 ^ *" « course, 

Deteroent f tompoBitirmo 
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,n 8 "*>*<*■ *• ""»°nUon provides a d.,erg e „, addMve comprising the protease of 

1 ^ 35 " d8toSen ' «"•—• ™» ~ one - 

more other en2ymes suoh ^ another proteaM as a|(a|ine ^ 

galadanase, a xylanese. en oxidase, e.g.. a laocase. and/or a peroxidase 

daemon, T"l ' he , Pr0p9rti9S <* ' -"-M — ■ be compatible wiU, .ha selected 

^ d'i e PH ^ OT ' COmPa " b ^^^ e ^oaadnon^z yraUc , nB ^ lente 
ate), and (he enzyme(s) should be present in effective amounts 

^ * ° rfUn9a ' Chento "* ™<*« or protein 

ngrneared m u,an te are inoiuded. Exampies of useful lipases inoK.de iipaeee from Hu^L 

and EP 305216 or from H. rasotena ae described in WO 96/1 3580. e Pseudomonas lipase e g 

P *>«*>"*™s *• strain SD 705 (WO 95/06720 and WO 
(1993). B,ochem,ca at Ebophysica Aofc, 1 1 31 . 253-360), 8. atea^fhaanppMaa (jp 64/744992) or 

* p r wo 91,16422) - ^ exa "* ies ara ,ipase «••— — - ^- 

WO Z 5 rr 1541 - ^ EP 26 ° 105 - W ° *™ W ° 9 ~- WO 95 307« 

WO 94/25576, WO 95/14763. WO 95/226,5. WO 97/04079 and WO 97/07202. Preferred 

available lipese enzymes indud, Upolasa™and Upolese Ulfra™ (Novozyn.ee A/S) 
Suitable emylasee (alpha- and/or bete-) include those of bacteria, or funga, origin 
Chemically modified or protein engineered mutante are induded. Amylase* include, for example 
alpha-amylases obtarned from 8ac*e, e.g. a specia, aba* of B. fcba^a. described in mcxe 

wo It ■ ■ 839 - °' are •» «— ' — - WO 94/025^ 

WO 94/163 4. WO 96/23673. and WO 97/43424. aspedaily the variants with eubsUtubons ,„ one 
or more of the following posibons: 15,23. 105. 106. 124. 128, 133. 154, 156, 181 188 190 197 
202, 208. 209, 243, 264, 304. 305. 391 . 408. and 444. Commercially available am^ are 
Dummy, . Termamyt™. Fungamy,~and BAN™ (Novates A/S). Ra P ldaee™and Purest 
<Trom Genencor International Inc.). 

Sutoblecelluleses include moseofbacedel or toga, origin. Chemica«y modified or 
£«. e^neared mutenfe are included. SuHable caiutases include celMesas from me genere 

TWavfa. AcmmoM™. e.g. the funga, ceilulases 
Placed from Hum/co/a /nso/eas. Myc*p*bom fbermopMa and PuseL oJeoS 
disdosed ,n US 4.435.307. US 5.648,263. US 5,691,,78, US 5.776,757 and WO 89/09259 
Especially suitable ceMases are d» alkaline or neufra, caiMases having colour cara benents 
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Examples of such celluloses ere celluleses described in BP 0 495257 » 
WO 96/29397. WO 98/08940 Other exam„,~ , , . ' EP M1372 ' WO 96/1126 2. 

WO 94/07998. EP 0 531 ^S^^T^T ^ "* " *«*«• * 

98/12307 end WO 99/0 s« C "■"^ US 5 ' 763 ' 25 ". WO 9572447, . WO 

8 CareryraeCZ-es^S^ 

KAO.500(B, iflJS^ST^ ^ H *" (B — - «- inc.,, and 

~ :rc:^r:~ de r * piam - - <*•*■* 

. Pen^frereC^^tTc J "'"^ E ^ 0f ^ P-0-ases.ndude 
'0 93/24618. WO 95/10602. and WO^/^T' "** ^ * ^ *"*- W ° 
Ouardzyme™^^, 98/15257 ' ^arcially available peroxidases Include 

these enzymes. A l^JZZZZZ??*? * - — «"*"*• - - 

18 addiave. cen be IdrmuleL e o aT ,>,,he , mVen, ' 0,, ' * 8 -<>— additive or a combined 

Canons are ZuZ 3 8 ** » "*™» Jargon, add^ 

"quids, o, slurries ^ * n ° n " dUS '* ,g »<***■ in particular srabfced 

NonKlusBng granulates may be produced en — m. . ... 
4.661.452 and may optionally be oL JT ° Sed US 4 ' 106 ' 991 and 

=0 materials are potyfe ^ " "- * coating 

m, alcoh* in whicLe aioo ot ^ ^ I' '° ? ^ «* "* 

.o 80 etbyiene ox*e units; /any alooj^^ " — « « 

acids. Examples of filnvforming coatino ™T-T ^carides of fatty 

» are given in GB ,483591 ^I^e n . " * *" 

polyo, such as prapyiene g^T TV* ** * -*- d * "** a 

established melhods. Protected enzvn, '. T ' aC "° ** " b °" C 3cc °" li W *> 

EP 236216. roteaede ^™s may be prepared according to the method disclosed in 

■ncluding s^Zin^: T " ^ >° «** 
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alcohol sulfate) aiooho, To^ T ^ ** sulfate (fany 

^e^^olTlZ^TT' nonwphe,K " e,hoxyla ' e ' **<**—*: 

e^e^^^ a ^itrr: carbona,e - ^ nM ~ ^ 

soluble sllcates or layered silicates (e. S . SKS-6 J HoecM, *"*•—*> -* 

The detergent may comprfse one or mora polymers. Examples are 

as JZSZZZl: b 'T ln9 S>Stem ** ^ CCmprise 3 source sue, 
such as telraaceZn^l " " P"**™* "each setter 

Phenyl boronic acid derivative such mi* , „ enVat,Ve< e *- an aromat '<> borate ester, or a 

en:yme protein per ^^ZZ^ZZTT** * 
d^tS^*"""" ^ b6 inC ° ~ in "» — — 
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Th6inVenti0nd ^^ „ 

embodiments herein disclosed u TOpe by the specffic 

■em aisciosed, since these embodiments are intended »i. , e »~e- 
aspects of the invention. Anv eouivaipm « m h h- . intended as lustrations of several 

Mn * equ'vaient embodiments are intended tr» ho „am~ »u 
invention. Indeed varioiiQ mowifi^ r L "uenaea to be within the scope of this 

modifications are also intended to fa,, JTl 7 re0 °' n9 "^P" 0 "- Such 

in e v ;z:r renoes are *" hereH - - - * 

EXAMPLES 

Materials and methods 

Strains: 

M.«MbPLia0l (Didehchsen, B eta.. ,990. Conhg of aldB which encodes 

Bacillus subtilis MB1053 
Bacillus subtilis PL3598-37 
Bacillus subtilis MB1510 

Ctonlng of aldB, which * * <1990 ' 

^^^^..^^^"an^ir^^ 

negaeve cells. The d,srup„on was performed essentially as described in (Eds A L 
Sonenshern, JA Hoch and Richa* Loslc* „ 99 3, BaClus subtills and o he^m- 
Pas,„ve Bacteria, Amehcan Society for microbiotogy, p . 618 >. 

Procedure fef ismaiin^ ggggpjg pfn 

Add'slLrr tebU,ferand,eaVeatroom ^mperature foriomin. 
Add 250 pi NH4Ac and leave at ice for 10 min. 

Add 500 p| CIA and mix. 

Transfer to a microcentrifuge and spin for 10 min. at full speed. 
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Transfer supernatant to a new Eppendorf tube and add. 0 54 volume «o.h • 

thoroughly. 0lUme 00,(1 'sopropanol. Mix 

Spin and wash the DNA pellet with 70 % EtOH. 
Resuspend the genomic DNA in 100 pi TER. 



TE: 
TEL: 

Thiocyanate: 



NH4Ac: 

TER: 

CIA- 



10 mM Tris-HCl, pH 7.4 

1 mM EDTA, pH 8.0 

50 mg/ml Lysozym in TE-buffer 

5M guanidium thiocyanate 

100 mM EDTA 

0.6 % w/v N-laurylsarcosine, sodium salt. 
60 g thiocyanate, 20 ml 0.5 M EDTA. pH 8.0, 20 ml H20 
d.ssolves at 65C. Cool down to RT and add 0.6 g N- 
laurvisarcosine. Add H20 to 1 00 ml and filter it through a 0 2 u 
sterile filter. M 

7.5 M CH3COONH4 

1 Mg/ml Rnase A in TE-buffer 

Chlorofprm/isoamyl alcohol 24:1 



Purification ofPPP bands -,n d DNA seg u^ncjng 

(Pha P ° o R fra9ment ^ ^ PUrifi6d US,n9 GFX ™ PCR DNA Gel Band- Purification Kit 
ne amplified PCR fragments are determined on an ABI PRISM™ 3700 DNA Am i , d • 

). fluorescent labeled term,nators and 5 pmol of the sequencing primer of choice. 

Media 

15 TY: (As described in Ausubel, F M etal {*>ri*\»n..~~ . , 

Wi.eyand Sons. 1995). ' in Mo,ec ^ Biobgy". John 

LB agar: (As described in Ausubel F M Ptai M e \ ^ 

JohnWaeyandSons.,^,. ' ' CuTOnl Mote^ar Boiogy-. 

20 rT G a9ar ' S ^ a98f 3UPP ' emented ^ °' 5% GITOS6 - 00S M P—— Phosphate. P H 



Proteoly tic antiwity 



10423:010-DK 



42 



15 



20 



25 



30 



on*. r teaSe ^ ' S meaSUred USln9 ,he PNA — » »«" ^nyf^anine-alanine- 

PNA assay is descnbed in Rothgeb, T.M., Goodlander, B.D GarnsonPH ,k , ! 

^lof^AmadcanOI, C^s^. Vo,. mm£Z2£T' H * ' 

Gene ayni-resion in B»,-f» „s man, ^ 

»» 9enas in me following examples are integrated by homologous 

on the Sac*s sub»* bos, cei, genome. Tne genes are exprLd I 
con^ of a *h promoter system ,as descdbed in WO 99,43835,. consisting of tha pmmotem 
fmm Sacz/as «en,Ws alpha-amylase gene (am*,, BaciHus amylCiquefLns alpha an^ase 
ana em,0 and „e Sac** „, orfn9fens& ^ promoler indudi „ g / J^TST 
Cbiorampbenioo, aoe«y,- tra „s,e ra se was used as maker. (DascLdinT' 

Example 1 

sp. NRP.L 18262 hawng me amino add sequence shown in SEQ ID NO: 43 (WO 01/58276) was 
conalrucled. which has the nucleotide sequence shown in SEQ ID NO- 1 ™T. 1 7 

resulang , n the ^ ^ fe ^ ^ ^ 

of m,s cornet were mada. Compared to me Sav-IORS protease ancodad by SEQ 0 NC 2 T 
tea vanan. construe, Sav-IORS HVO was conslructad to have 8 amino acids aL in the C 
terminus: QSHVQSAP (SEQ c NO: 3, which wans ancodad by the foliowing DMA sequence 
ex,ena„ninsar,adinfron t of,heTAAstopcodonofSEQID N 0:2: A ■*>"•"» 

(SEQ ID NO: 4): caatcgcatgttcaalccgctcca 

terminus- O^aT^l ^ *" * 4 "* » — ■» - C 

(SEQ ID NO: 6): caatcggctect 



terminus- QPfS^'o^ **" """ U '* d '° ' I ~ 2 a ™° — *> ■» C 
35 ^ sTpccTol ' •* ^ '°"^ n9 ° NA «* — — - »° 
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(SEQ ID NO: 8): caacca 



Tail variant Sav-10RS H V2 was constructed to have one amino acid extra in the C 
(SEQ ID NO: 10): cca 
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recomh- T T ^ ^ ***** 96068 Were by homologous 

-omb-nat-on ,nto the Sac//,us suMls MB1053 host cel. genome. Chloramphenicol LTanT 
-sformants were checked for prptease activity on 1 % skim milk LB-PG agar p,ates 

6 ^ Ch, °~^ *"» P^ase posiSve colonies'were further 

ItTeaTc T ^^^'^^^^--^eneDNAseauence, and five 
stra,n each compnslng one of the above constructs, were seiected and denoted, respectively 

HV 2 anL B SUmS SaV - 1 ° RS HV °- B SUbWiS S3V - 10RS a-^ S ™ 

HV2 and B.subtitts Sav-1 ORS HV3. 

Example 2 

performed'rT 0 " 8 ** Pr ° dUCti ° n ° f *" e ^ mes " *. invenflon were 

Performed on a rotary shaking table in 500 ml baffled Erienmeyer flasks each containing 1 00 m, 
TY supplemented with 6 mg/l chloramphenicol. 

Six Erienmeyer flasks for each of the five 8. scoMs strains from example 1 were 
ermented in paraHel. Two of the six Erienmeyer flasks were incubated at 37'C (250 rpm) two at 

day 1 , 2 and 3 and analyzed for proteolytic activity. The results are shown in tables 1 -3 As it can 
eseen from tables 1 -3, the effect of the tai.s is a surpnsing.y high improvement on the 
express™ .eve, of the protease, as measured by activity in the culture broth. The effect is most 
pronounced at 26'0 and 30'C, but is a.so evident at 37>C as an effect observed espedaZ e 
early stage of the fermentation. especially at the 



30 Table 1 : Relative proteolytic activities at 37'C 

Day 1 I Day 2 



Sav- 



-10RS 
Sav-1 ORS HVO 



Sav-1 ORS HV1 



Sav-1 ORS HV2 
Sav-1 QRS HvT 



1P- 



3.3 



4,7 



£2. 



5^3. 



-L9_ 



P.7 



1A. 
0.6 



1.4 



1.0 



0.8 



12_ 



0.4 
1Z_ 
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Sav-10RS 


Day 1 


Day 2 


Day 3 


OSV-1UK8 MVO 


1.0 
1.7 


1,0 
2.2 


1.0 

2.9 


_Sav-10RS HV1 
Sav-10RS HV2 


4.6 
2.4 


3.1 
1.9 


4.9 
2.3 


Sav-10RS HV3 

Table 3: Relative proteolyti 
Sav-IORS 


4.8 

c activities 
Day 1 


3.0 

5 at 26°C. 
Day 2 


4.4 
Day 3 


_Sav-10RS HVO ~ 
Sav-10RS HV1 i 


1.0 
1.8 
2.5 


1.0 
2.5 
3.6 


1,0 
3.1 
4.3 


Sav-10RS HV2 I 
Sav-IORS HV3 


1.8 

2.6 "7 


2.6 
3.5 


2,8 
4.6 
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Example 3 

encodino ,0 T" 9 C °" S ' rUC ' W!>8 ^ ,<>r ^ """""—n- integration of the tail-vanant 
^277 ST 9 SeQUenCS OMh9 We "- kn0 "" —* BPN'pro.easowas 

Sac*s suMs were tee, to lhe as fen ^ polynudeolitle 

I enCOd,nB «■»*«*■ ^ integraaona, 

denoted PL3598-37 was setecled and confirmed by sequencing to contain the conect construct 

The PL3598-37 clone thus contains the following: 

1. The flanking regions 100% homologous to region of the B.subtilis genome (appears as the 

upstream fragn^ntyfmD-ytmC-yfmB-yfmA-PelstartanHtho^ ♦ fc 
yflS-citS(start)) downstream fragment Pel-end- 

2. The Spectinomycin resistance gene flanked by Resolvase sites (res) 

3. The triple promoter region plus Oy//,A mRNA stabilising leader sequence. 

4. The BPN Open Reading Frame. 



Construction of trirtte-pm^oter rpm> 
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construct T fra9ment inte 9 rational ^sette for a BPN' ,ib rary was 

Bacillus amyL-derived orom^r r k P Pr ° m ° ter * 3 fUSi ° n of an °P timi «* 

» QinyL uenvea promoter (as shown in wn Q^/ino/io. 

scBAN and crylllA where th. tirl • 83,10248. N ° V02 y mes ) wit * two promoters 

99/43835; Novozymes) Suited ■ m RNA-stab.l,s.ng sequence (as described in WO 

#251992 (SEQ ID NO: 12): ^gcat^glogaocgatetagagcggattgaacalgcs 

DMA. The PaC-Spac 1ra9man. was prcaucad by PGR aml^ . 9en0m ' C 
B.sum S stan MB1053. using lha priJT ' * 9en °* ni ° ° NA ,rom ,he 

25 #17 g5 42( SE Q ,D NO: 14): 0W* Wa , WBg- ^ B J^ 
Constructi on of MB1053 

defined by the ends of a third PGR ftjl T w h °™'°90us to DNA sequences 

irssrssssrrs. 
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and it was situated between two Res site* 

-n US paten, Mo. 5.882.688,. CTJTS^ZT " " ^ 

) ir« e amerent PCR fragments were initially produced. 

st-ain PL2306. using ,ne pLars * ram0SOTral ° NA *" *«•» 

#179541 (SEQ ID NO: 13), and 

,0 * 179539 * «-»> - ««« s^c ^ (SEQ , D N0 . 15l ; 

« sfca of .ragman, 2 ,s ^ T^at T " ° aSSe, ' e * mit,< " e °' ^ Pel S e "* The 
#179542 (SEQ ID NO: 14), and 

#1 79540 with overlap to #179153 Spec primer (SEQ ID NO- wy 
ggatccagatctggtaccc^^ 

££™ ' D 17>: 9tt9taaaa ^-9'aaa t .c, 8ate aaa l89 
#1 79153 (SEQ ID NO: 18): cC9Cgtcgacacfagacacgggtacctgatotagato 

Standard conditions for ih» p,- R 



35 



5 pi Buffer 2 

14 pi dNTP's (1.25 mM each) 
2.5 ud 20 pM primer 1 
2.5 pl 20pM primer 2 
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x pi water 

The cycling profile is: 

1 cycle of 120 sec at 94'C 



Break. 



10 cycles of 15 sec at 94'C. 60 sec at 60'C. 240 sec at 72'C 

20 cyc.es of 15 sec at 94'C, 60 sec at 60'C, (180 sec at 72'C add 20 sec pr cycle) 

1 cycle 600 sec at 68'C. ; 



The three PGR fragments were made and joined in later JOINING-PCR reactions The three PGR 

uZ« ^ e sin9 ' e sharp bands and no 9e! purification was — y- cZXcr 

punfication was performed prior to the following JOINING-PCR 
JOINING of fragment 1 * 3 (same procedure for fragment 2 + 3)- 
15 5plBuffer2 

8pldNTP's(1.25mMeach) 
5.0 pi Fragment 3 
5.0 pi Fragment 1 
9.25 p| water 



30 



1 cycle of 120 sec at 94'C. 
Break. Add Enzyme 

10 cycles of 15 sec at 94'C. 60 sec at 60'C, 240 sec at 72'C 
Break. Add Primers 

15 cycles of 15 sec at 94'C. 60 sec at 60-0,(180 sec at 72'C add 20 sec pr cycle) 
1 cycle 600 sec at 68'C. ' 
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reaction (Expand- ,ono s ^ m 1,11 ^ ^ Were **- " a «ast PGR 



reaction (Expand- long system, Roche): 
5 pi Buffer 1 

14uldNTP's(i.25mM each) 
5.0 pi Fragment 1+3 
5.0 ul Fragment 2+3 
17.75 pi water 



After the first cycle at 94°r for 1 oft *u 

To* volume ,s now 4 5 0 Ml *"* * 8 ^ «*- 075 -* * *oded. 

t cycle of 120 sec at 94*C. 
Break. Add Enzyme 

B rea ,j o pr;r ,,5sMa,94 ' c ' 6oseca,6 °- c '^- a '^ 

The size oflho joined PCRfragment is 6.8 kb This PCRf™ . 

Oregen~ PCR purification kit m c ■ . - . fragment was purged using a 

B.suMs s,aln Aftert nsfll " f ** 50 M ' *» — *> -—ban a sta „dard 

- -«™an~rr^ 

las. JOINING PCR amplification ylJZ pcp , * USin9 "* *» 

oe,e«on had no, occunL "ulCe ^ "** ^ *" "» " » •*««■ » 

•he Mandni . cassty I^ZL ^ "**"' WM ~ «* 

raken togefher * the a^^^^T - ' ^ ™ S 

30 selected and denoted MB10S3 " ^ ° m " re4 °" e su <* <*>"« was 

Insertion of RPM f exore^inn 

— -^Bresston cassette adjacent to thn rr ^ „, um j !lMB3 ^ 

The ligation mix of the diqested PCR -, mr ,, ifi ^ ♦ . . ' 
end ,he Kpnl-Sal digested Ped-SoTpctT T ^ BW expre **" — 

» -g ,ha PCRpnmers «" — " ^ » • «* 

0179541 and #179542. Th,s resulted in a PCR fragment of approx. 9 Kb. 
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which was used to transform B.subtilis PL1801 fr****^ c „ 

encodes alpha-acatotactote decease a 0^7/ B C '° n ' n9 

431* x x ooxyiase, an exoenzyme from Bacillus brevis. J. Bacterid 172 

restreaked on Specbnomycn agar plates and analysed for the integration of the PCR 
fragment * P CR using the primers *1 79541 (SEQ ID NO: , 3) an(J #1 

into the hos^T " V * ^ ^ ^ ^ PCR haS ^17 

Z^T^rZ' s TT of 1,1656 dones were sequenced to «*" 

^ P ession cassette, one such clone was selected and denoted PL3598-37. 
Example 4 

An Intend £0 °" P ^ Sm " W> ° me «— 'or a library may be ccns.uc.ad ,„ vivo 

I) The DMA sequence encoding the Pre-Pra-domains of the auMWn please 
common,, Wn as Savlnase, preceded by and operabiy Ifcked to 

case a^t C^T 3 ^ ^ ^ * ~' 

» B) a marker Bene (a chloramphenicol resistance gene), and 

5' pcynucleocdet 0 "* T ^ " ? "» 3 ' homologous 

5 po^nudeobderegnn upstream of the polynucleotide lyfmD-ytmC-yfmB-yfmA-Pel startl L 

5 and PCR ITSTr ^ ^ S,6PS dl 9~«°" =>f pUC,9 plasmid 

and PCR fragments «ft appropriate restffcffon endonuclease sites of several different PCR 
fragments ,„ tne generally used ^ ^ ^ ™ 

Piasmrd. the «ga fi on mixture was transformed into electrocompeten, DH5a,phal~ "a, 

ul r ^ * * nal ^ — «-*»«• »y aaguenclng 

to con.a,n the correct ^struct as outlined above, and it was denoted PMB1508. 

The PMB1508 plasmid thus contains the following- 
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iii) The chloramphenicol resistance operon; 

iv) The 3' downstream flanking region [Pel-end-ynS-dtS(start)] which is 99-1000/ 
» homologous to the region of the B.subtili s . 

37254- v ^ 6,ementS HSted ^ the PUC19 vector (Stated from E co„ ATCC 

37254, V.e,ra J, Messing J. The P UC p,asmids, an Ml3mp7-derived system for inse^ 
mutagenes. and sequencing with synthetic universal primers Gene 19- 259 268 1^ • >k 
EcoRl and Sail sites to give pMBlSOS in ™wr rt .k ' 982 ) n the 

a specified site of th B su b TZZ a nt reSU ' tin9 ^ t0 aflW - n * to 

derivative of Bacifius suCl68 BG^C a ^ ~ ** * * 

.) The tnple promoter and the mRNA stabilising element 

Thus, when using MB1510 competent cells, it ts possible forth* oMBISoa t™- 
denies <he re o„ to di re cu y integrate into the genome o, MB1510 ^re he Ca«no 

=£5~~===== 

tk H'umoier, me mRNA stabilising e ement and thp prq 

™ " hi9h tt 0 " o,,he in,esra,ed 9sto - ,he ~ ~ £T 

present on the genome of MB1510. 

was "CR amplified using the two PCR primers: 

Primer #317 (SEQ , D NO: 19) tggcgcaatcggtaccatgggg 

Primer #139 Notl (SEQ ID NO- ?m » 

1 U ° NO - 20) ^Scatgcggccgcattaacgcgttgccgcttctgcg 
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with K , ^ r" 1 "" 9 "** k " °' *" SaV '" aSe ' ra9ment and ,he » MB1 508 <*-« - digested 

2 1 ' ^ *" reSUl "" B ' ra8men,S " — * — - ^phoresis. 

The .wo fragments are ligated, and .he „ga«on mixture is used ,o transfom, competent £. co/t cells 

wbich are .hen plated on Lftagar plates or placed in liquid media for growth overnight at 37X- 

bom rypea o, rnedia containing SO-IOOpgAn, 0 . Amp,o,,,in. After inoubation, a plasmid prep is made 

(usng 100-1 0.000 ng of ptesrnid per Information. The transfonned cells are plated onto TY 

^ 2 r Sk,mm " k ^ 6 VB "" «""— ' After ovemigh. incubation 

a. 37 C deanng zones appear around .hose colonies wherein the integration cassette is 
integrated properly into the cells, indicating high Savinase expression. 

This approach can also be used to make highly diverse libraries of any gene of interest 
expresses m fl.su**. where rather lhan a gene encoding one enzyme, any expressable 
paynucleotide Is inserted Wo the plasmid pMB1508 and Integrated into the MB1510 stein for 
subsequent screening. 



Sequence of plasm?*. p MB15Qfl fSFQ id mq- 91) 

The plasmid pMB1508 has the following components, indicated by basepair positions- 

BP 5186-395: pUC19 sequence from E.coU clone ATCC 37254, Vieira J, Messing J. The 
pUC p.asm.ds, an M13mp7-derived system for insertion mutagenesis and sequencing with 
synthetic universal primers. Gene 1 9: 259-268, 1 982. 

BP 396-1021: EcoR I cloning site (BP396-401) and the Cry.HA mRNA stabilising 
element. {Described in WO 9634963-A1 ) 

tP„ P Z r 2 ",! 41 * EnCOdeS "* P ' e - P '° SeqUen ° e 0,S « i ™» =«d the Aft* cloning site. 

TrT T„ W ° 9623 ° 73 ^ 1 ' *» "* * and "» "»*■ <-«— *• Pre- 

Pro and Afofl was introduced by the PCR primer. 

BP 1413-2512: The Bgl II cloning site (BP1413-1418) and the Chloramphenicol acetyl- 
transferase operon of pDN1050 (Described in eg. Diderichsen.B, Poulsen.G.B, Joergensen.S T • 
A useful cloning vector for Bacillus subtilis. Plasmid 30:312 (1993)). 

. T 13 ~ 5185: ^ PO,ynuc,eotide re 9 ion rPel-end-yflS-citS(start)] downstream of the 

peB locus of the B.subWs genome, (as it appeaars from the publication and corresponding 
debase of: F. Kunst, N. Ogasawara, .. Moszer, <146 other authors>, H. Yoshikawa, A. Danchin. 
The complete genome sequence of the Gram-positive bacterium Bacillus subtilis" 
Nature (1997) 390:249-256). 
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MB1510 has the following specific features in and around the pelB focus- 

sZZl Pr ° m0ter ^ ^ ^ Stabifeing 6tement ~— * « 0*— binding 

S gquspce at MR1S10 i^^ on rgninn f gB „ ,„ ^ ^ 

'0 yfmA-Pe, L 1 '° ° f B "*« «*» 9enon,e yfmD-^C-yfmB- 

^e Z„,T ^ PUb '" :a,i0n and «»-P"*0 database of: F Kuns, at a, 

Yoshikawa A dJT^ °S^wara. I. Moszer. <146 Char authors*. H. 

20 

Example 5. 

in fr oducad^,r r r' a ,''" Va,iant ""^ — ^ *» adds wara 

25 AlwMKiORs as template: Pnrners in a PCR reaction using genomic DNA from 

1605 (SEQ ID NO: 23): gacggccagtgaattcgataaaagtgc 

1606 (SEQ ID NO: 24): ccagatctctatnktnktgtacggagtctaactccccaa^ 

30 wherein N = A, C, G or T; and K = T or G. 

PMB1508. Hereafter following the principle described above. 
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containing 0.05 XTY **J^ZT£Z ' "* *** ^ 

incubated at 26'C for 72h jZ T IT chtoram <'"«> fc °< <«W»e»). The MTP S ^ 
6 c for 72n. After rncubaton eaoh „e„ was analyaed for prateolytic acMly. 



10 



AATaSI 


jNo. of transformants 


IL ft 


TT E 


QL 


3 


TP 




LP 


13 


Tl 




IQ 




QP 




PI 




LT i 


1 


TQ 


i 


IT ] 


i 


QQ t 


i 


PQ 1 




Total p 


to 



Possibilities position 1 
K 
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T ~"~ 


14. 


T ~ ' 


6~~ 


1 


3 


1 


* 


Q 

P ~~ ' ' 


6 


Q 


5 


— 


3 ~~~~ 


P 


8 




4 


L 


7 


Total 


30 


Total 


30 



^-vccns^™™^^ ^ ' phmer used for the 

»e«^. ^ find,ngS bV »» - *• Wty colonies isolated from 



Example 6 

Gonstruction of flerife ,„,,„„.. „„,.. u L , W)n , , r >4 

» dessonv* suoTp CT» ProJOrm ° f ^ S2A •» 

Becfc cfeusf, available ton, m„ »»*nown commercial protease derived from 

suWfc Sev- J! VOZ>meS ' Denma^,t, • "» "M* ** - denoted Bac*s 

1475. b shown ,n SEQ .D NO- 25 TneTrL M ' amP "" ed W " h ^ 1423 a "" 
•he L2 pmtease is shown in lD N0 T 9 ""^ ^ ^ ^ 

147MSEO 2f "^^^ctccggccccc^ccc^ 
1475 (SEQ ID NO: 27): 99agcggattgaacatgcgattaggtccggatcctgacaccccag 

■ constructed"^ anb ° f ** - ™ vahan, sav.L2 HVOwas 

conslrucled to have 8 em,no adds extra in the C-terminus: QSHVQSAP (SEQ ID no- <n h 
the DNA sequence extension in..- • , ^n'UMf (SeQ ID NO: 3), by using 

NO: 4. Tail LanTst U hv " ^ ™* * P ° 0 * n " M * * *> « ID 

» QSAP (SEqId NO 5 ^ T COnS ' a,Cted 10 ^ 4 ami "° ^ h "» C — 
stooj k u 9 " ° NA SeqUenCe eXtem,on 1" front of the TAA 

stopoodon wh,ch is shown in SEQ ID NO: 6. Both ta» variants had the SAvj^SE™ all 
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above. 11 r MB1053 toS ' ^ » " 

colon.es ws, XI ^ 6 ,,9 " , " *™ P-e=se posKve 

sequence, end one s."„ t eel T * " "* '° C ° n « !rm ' he «*• ™» 

Example 7. 

»uoi me six EDenmeyerflesks were incubated a! S7-r-«sn_ . . 
um). and Ihe last two at 26'C (250 m m > a , ' tV "° at 30 ' C < 250 

and 3 and analyzed fa l^Jotl T — **"*-" «acn shake task at day 1 . 2 

increase of up to 40% is ob^ • ^ ex P ressed * suW/Yfe. An 

tenants L, ^CTtae " °— > ~ <* - 



Table 6. Relative proteolytic activities at37»c. 



Sav-L2 



_Sav-L2 HV1 



Sav-L2 HVO 



Day 1 



\0 



1.4 



1.3 



Day 2 



1.0 



1.3 



ii 



Day 3 



1.2 



Table 7. Relative proteolytic activities at 30'c. 



Sav-L2 



Sav-L2 HV1 



Sav-L2 HVO 



Dayj 



1.0 



1A 



Pay 2 



io 



1.2 



1*. 



Day 3 



1.4 



Table 8. Relative proteolytic activities at 26°C. 



Sav-L2 
Sav-L2 HV1 
Sav-L2 HVO 

Example 8 



Day 1 

JL2 

1,3 
0.2 



Day 2 

JL0 

_u 



Day 3 

M 

1.1 

1-1.1 
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an.no a., sequence ,'s JT h «^ .THl "IT * "* 

•he W U " ' D N ° : 6 fr0 "' * - ™A aopcodon of 

U210R HV,, ^ aen0,6d 8 S< "" ,fe L210R - *"*» HVO. and S.a ui * 

Example 9 

Shake flask al day 1 2 and 3 anH « M i V Samf> ' e " teken from «* 

1. end fa teHes 9 , ; ' ! 7 " P,0te °' y,iC •*** ^ " — " 

proragion from "oR^l *" *•* ° ,,he <* ■» 

Table 9. Relative proteolytic activities at 37«c. 



10423.010-DK 



10 



15 



20 



57 



_10RSHV1 
L210R 
L210R HVO~ 
L210R HV1 



1± 



5,3 



9,1 



8,5 
2,3 



14,4 

.20,9 



4,3 



1A. 



Table 10. Relative proteolytic activities at 30«c 




Table 11. Relative proteolytic activities at 26°C. 



10RS 
10RSHV0 



10RSHV1 



L210R 



L210R HVO 
L210R HV1 



2.6 



3.7 



0,4 



2.3 
2.2 



Day 2 



1.0 



3,3 



0,7 



2J. 



Day 3 



2.8 



3.1 



Example 10 

(a) nocardiopsis dassonvitlei NRRL uht» « ^ ... ^ 

... ^ w . " w/w NKKL 15133 as descnbed in WO 88/03947- 

sequences of the protease derived from Nocordiopsis so JMRRl. KWfio 

Paten, app»oa»Ori „o. 1 996 000,3, and WO 0,,5«™ del h 1 ' n DK 

aoid-stable proteases relets ,„ .„ . USe anlmal feed ° f 

Id, « ^ 35>: ^^'^'^cagggtoagaoo. 

,d, :rr~eri^rs=^^ 

:38):9 "-^-'^ooaocggaoogotocoooagto 
1602 (SEQ ,D NO: 39): gcggatcotenaggtocggagaoggacgoocoaggag 
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T^TZ 1S649; *• amin ° - "» * SEQ ,D NO' 

41 . the encod,n B nucleotide sequence is SEQ ID NO: 40: .he gene b iso,e,ed from »» ' 
genom,c ONA of M strain by PCR-amptificaBon using the two primes- 
«m(aea ID NO: 42): gttcetoge.cgcef^gccecoggacceccccccag.c. end ,602 (SEQ ,D 
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Example 11 

™ e 0 e *™^of,heNoca«^ 
rotease essayed in e monogasmc fn dfces«o„ mode,. The peHbrmanoe of a puld 
P epareuon of me nraUrre par, o, me protease heving SEQ ,D NO: 28 (prepered as dCed 

Te^r ,eS,ed '° r "* ^ 10 lmprove «M— . of mate/- 

mcubeted w„h HCI/peps,n - srmulaUng gastric digesSon - and subsequently wilh panoreatln 

ir: 9 in, T' di9e * n - 10 * ,ha nasks ™ a ™<> - - — h ; 

inouZ h - remaWn9 88 b ' ankS - A * «» « ° f »° 



Components added 




5mlHCI (0.105M)/ pepsin 
(3000 U/g substrate), 1 rnL 
protease of the invention 



16 mJ H 2 0 



7 ml NaOH (0.39M) 



5 ml NaHC0 3 {1M)/ 
pancreatin (8 mg/g diet) 
Terminate incubation 



3.0 



6.8 



6.8 



40 a C 



40°C 



40°C 



t= 1 .0 hour 



t=1.5 hours 



t=2.0 hours 



t=6.0 hours 



Gastric digestion 



Gastric digestion 



Intestinal digestion 



Intestinal digestion 
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Conditions 

Substrate: 4 g SBM, 6 g maize (premixed) 

PH: 3.0 stomach step/ 6.8-7.0 intestinal step 

HCI: 0.105 M for 1.5 hours (i.e. 30 min HCI-substrate premising) 

5 pepsin: 3000 U /g diet for 1 hour 

pancreatin: 8 mg/g diet for 4 hours 

temperature: 40°C. 

. Replicates: 5 

10 Solutions 

0.39 M NaOH 
0.105 M HCI 

0.1 05 M HCI containing 6000 U pepsin per 5 ml 
1 M NaHC0 3 containing 16 mg pancreatin per ml 
15 125 mM NaAc-buffer, pH 6.0 

Enzyme pr otein determinations 

The amount of protease enzyme protein (in what follows, Enzyme Protein is abbreviated 
EP) ,s calculated on the basis of the /^values and the amino acid sequences (amino acid 

r 8 ~ 2 t prindples ouuined jn scGi " & ph - von Hippei> Ana,yticai Biochemi ^ 

Experimental pi™ e dure fnr in wjtro modal 



35 



1 2 , Th ; ^ 7 h er,rT1enta, PrOCedure was acco *ing ^ the above outline. pH was measured at time 
. 2.5, and 5.5 hours. Incubations were terminated after 6 hours and samples of 30 ml were 
removed and P ,aced on ice before centrifugal (10000 x g, 10 min, 4'C). Supematants were 
removed and stored at -20°C" 

Analysis 

" Sam P les were ^alysed for % degree of protein with the OPA method as well as content 
of solub,l.sed and digested protein using gel filtration. 

PH determination by th» n PA . mftthriH 

The Degree of Hydrolysis (DH) of protein in different samples was determined using an 
sem,-automated microtiter plate based colorimetnc method (Nie.sen.P.M.; Petersen.D, 



10423.010-DK 



60 



20 



25 



30 



35 



Z?Z£!ZTT ot ,or determ ' n ' n9 ^*»-o. MroVsis . J . FoodSci 

2001 66. 642-646). The CPA reagent was prepared aa followa: 7.620 g di-Na letrabo^.. 

ZZV and 200 m9 (SDs) — — 

about o?Z£ " ^ * **« "* "W* «° - P«0 are) of 

uu. ., owitzenand). All other spectrophotometer readinqs were Derfnrm^H »t 

tfOPA reagent was automatical dtspensed. Ptates were shaken ,2 min; 700 rpm) beL 
"r *"* - ° H ~ ~ ^^„a„o„ P o,rr pl ea„as 

Estimation of soli ihllteoH dinestgrt pr ^., n 

«.,■ , T ^° 0ntem ° f SOh,bi " sed ««*"> supematants from in titro digested sampias was 

ffltered mreugt, 0.45 pm polycarbonate flitere and diiuted (1:50, v/v, with H,C, DButed ^7 
were ohromatographed by HPLC using a Superde* Pepnde PE (7.5 x 300 mm) get «re2 
column « Tbe eluen, used for iaoere^on was 50 l^ZZ^^ 

0 4 mTnT, 150 T* ^ "° IUme « - ™ - » « and tne o^L 
del™ „ f reCOrded 31 214 " m « *• «~ area under tbe profiles was 

deemed by mtegreaon. To estate p^n oonten. from integrated areas, a cir c le 
"°" 93) a 13 " 6 from a dilution series of an In vto, digested reference maize, SBM ~ , 
w»h know. ,ota, protein conlent. Tbe protein delem^nata lis refeZ sam7was JrT 

Analysis 14th ad., Washington DC). 

The content of digested protein was estimated by Integrating the chrematogram area 
Responding ,„ peptides and * adds having a molecular mass of ,500 Daiton" Z. 

^ i^T SF ' ^ F ° rT,K ^ "™en, Of Pretefc Diga^ " 
FoodSa. ,066 51. 404-406; Babinsaky.L, Van.O.M.u.M, Boer.H, Den,H.L.A An In-vitre Method 
for Predicbon ofThe Digest Crede Protein Content In Pig Feeds. J. Set. Food Agr. logo. 50 



10423.010-DK 



61 

173-178: BM i, Eggum.B.O. CrKica, Evaluation of ln-vit ro Methods for Estimating Digastfbiiily 

NUW,,on RMeart * ~ ,991 ' 4 ' l4i - ie2 >- T ° ~t 

" I T . 9S ' fil,raSOn C °' Umn — °* ibrated ^ <*°*"™ C (Soehnnger 

fla8Wn '• and subs,ance p (S,9ma usa) ' as «- 



10 



15 



Results 

The results shown in Tables 13 and 14 below indicate that the protease increased the 
Degree of Hydrolysis (DH), as we,, as so,ub,e and digestive protein significant 

Table 13: Degree of Hydrolysis (D^ bjolyte »nH relative values 



Enzyme 
(dosage in mg EP/kg 
feed) 



Blank 



Protease of the invention 
(100) 



Of total protein 



%DH 



26.84 



28.21 



SD 



0.69 



0.35 



Relative to blank 



%DH 



100.0 



105.1 



%CV 



2.57 



Different letters wUh.n the same column indicate signal dances (I^NOvi Tukey- 
Khjmjr test, P<0.05, SO . Standard Deviation. o /oCV - Coe.oient of VadanL = (Sol^ Le) 

T able 14: Soluhilised and diq astort crude protein m qasured hv Akta mdi . 



Enzyme 
(dosage . in 
mg EP/kg 
feed) 
Blank 



Protease of 
the invention 
(50) 



(100) 



Of total protein 



%dig. 
CP 

54J 



57.7 



58.9 



SO 



1.1 



0.8 



%sol.CP 

90T 



93.2 



94.8 



SD 

TT 



1.4 



0.9 



Relative to blank 



%dig.CP 

100.0 ' 8 



106.7 



108.9 



CY% 

2Tb" 



1.9 



1.3 



%sol.CP 

100.0 



103.4 



5BS55 letters within the 'same column indicate' s,gniflca"nt diitere'nces (I^^X Tuk ly- 
K,mer test, P<0.05, SD = Standard Deviation. o /oCV . Coefficient of Va.anoe = (SDWnlue) 



cv% 

T2 



1.5 
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10 



15 



20 



Example 12 

-naming 5 ^ ^11 £T " ^ """"^ Whereas 

^«rr::;::r::r:r^- 



Outline of aq„ a ft, ,„ Y m digestion procedure 



Components added 




Conditions 
Substrate: 
pH: 
HCI: 
Pepsin: 
Pancreatin: 
Temperature: 15°C 
Replicates: 5 



10 g extruded SBM 

3.0 stomach step/ 6.8-7.0 intestinal step 
0.155 M for 6 hours 
4000 U/g diet for 6 hours 
8 mg/g diet for 17 hours 



Solutions 
1.1 M NaOH 
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0.155 M HCJ / pepsin (4000 U/g diet) 
1 M NaHC0 3 containing 16 mg pancreatin/mL 
125 mM NaAc-buffer, pH 6.0 

E xperimental nrocednrA for gg ya In vitm mode] 
Analysis 

HP:c~ *• ° PA — » -gree of hydrolysis) and by Akta 

erm,ne solub.l.sed and d.gested protein (see monogastric example). 

Pre-treatment of in vitro supematants with pacv o PF rn ,„ m| | 
Results 

The results shown in Tables 15 and 16 below indicate that tho „ ♦ 
increased Degree ofhydrolysis and protein digestibiHty. Pr ° teaSe 
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Different letters within the same co.umn indicate significant differences (1-way ANOVA Tukey- 
Kroner test. P<0.05). SO = Standard Deviation. % CV - CoefHcient of VadanL . (SO/rnean Taiue) 



labteJSLSolubilised and digested crude protein 
values 



in measured by AKTA HPLC, absolute and relative 



Enzyme 
(mg EP/kg 
diet) 



Blank 



N 



Of total protein 
%CP 



dig 



50.0 



SD 



2.2 



%CP 
sol 



89.9 



SD 



3.2 



Relative to blank 



%CP 
dig 



100.0 



%CV 



4.5 



%CP 
sol 



100.0 



%CV 



3.5 



Protease of 
the , 
invention 
(50) 



91.4 



1.5 



104.8 



2.1 



101.7 



(100) 



53.4 



0.4 



91.6 



1.0 



107.0 



0.7 



■ ■ — — — 1 I II | j ■ w.-w U. f 101.9 ** i -| 

Differe.nl iette^ w ,u„„ , he ^ ^ U xtl^Lrtt diffUnUs (1-Ly AN<j V A W 
^™ P< ° ° 5) - SD ' — « °— . Coemcen, of VaJe . (SD/^fu.) 



1.6 
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CLAIMS 



33. SEQ ID NO: 37: SEQ ID NO: 41; or SEQ ID NO- 43- 

(b> Z^oTrT 0 " sequenoe b * l9ast 70% ldenfai * •» -*» «* 

^0 ™S , »T*, , t^" h ' SnCOded by 0,0 P*»— • ^ SEQ 

r^Q? D ;°o 2 40 SEQ ,D ' D ** « » - 3, SEQ ID 

(0 eompdsas a mature par, which is a varfan, of «ha m a,ura par, of ft. polypapada having 

(d) is an allelic variant of (a), (b), or (c); or 

(e) is a fragment of (a), (b), (c), or (d). 

9 more anan^aadfs) addad to tha Camrlnus aa compared ,o <ha wadtype. 

30 



35 



6. The polypeptide according to any of claim* 1 «; ,»k^k ~ 

from a dHfarsn, please- pm,a rab "v L oT J * •«»««*» 

moa, prafambly , la « ^^^7" " fr0m " °' S1E and 

30. SEQ ,D NO: 33. SEQ I NO " B ZZ^T" " '° ^ 2 °' SEQ ° N <> 

iu inu. j/, oEQ ID NO: 41 , or in SEQ ID NO: 43. 
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7. The polypeptide according to any of clalm«5 1 fi«,hi„v, 

a ««iy or aaims i - 6 which comprises a heteroloaous spri^ti™ 

the heterologous secrabons,gna, peptide Is derived fan, a heteralogous pratease. 



8. The polypeptide according to claim 7 wh*»r*»in »h Q • 

„ ' wnere 'n the heterologous secretion signal Deotide 

zzt an "*° a " a se,,KnM ha ^ = -~ - - — ™ur*,„ 0 

acd sequence encoded by polynucleotides 1 - 81 of SEQ ID NO: 2. 
10 9. An ,so,a,ed polynucleotide encoding a polypepfide as defined in any of Caims 1-8, 

express,on uector w ****** ~ — a «. - 

<5 11. A tecombinan. host ce„ comprising a polynucleotide as defined in Cain, 9, or an expression 
vector or polynucleotide construct as defined in daim 10. express™ 

12. The recombinant host cell according to claim 1 1 which is a Bacffius cell. 

20 13. A transgenic plant, or plant part, comprising e polynuCeolide as denned in claim 9 or an 
exprees.onvactor^polynudeoweconstructasdefinedinc.aimio. 

14. A transgenic, non-buman animat. or praducts. or dements thereof, comprising a 

15 m Arr K ^, orp ^ 9a ^ ptideasde ^ |nanyo(c|aims ^ 
Compaq (a a recomblnan( hos( m< ^ da( . ned ^ ^ 

30 Lt T mal r deBnedi ' ,da,mSl30r,4 ' ,0 ^««— n, comprising me 
30 polypeptide, and optionally (b) recovering the polypeptide. 

m An anima, feed addifive comprising a, iaas. one poiypeptide as defined in any o, Cairns 1 - 8; 

(a) at least one fat-soluble vitamin, and/or 
35 (b) at least one water-soluble vitamin, and/or 
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(c) at least one trace mineral 



any oi aaims 1 - 8, or at least one feed additive of claim 16. 
Phospholipase A, (EC 3.1,1.32,; phospLpJ ™ 3 7! k 

phosphatase C (3.1.4.3); phoiho.pL ^cfl 4!, "L * *"*»*WP«- <» 3.1.1.5); 
EC 3.2.1.6) Pnonpase D (EC 3.1.4.4); and/or beta-glucanase (EC 3.2.1.4 or 



^:^zTZT,r msm9 inciudin9 me poi ~ ) - — ■ ~ - 
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ABSTRACT 



5 



Modtaget 

srfl«'™- . ,2BEC ' 2M3 



<U0> Lassen, soren Flensted PVS 
<120> Improved protcases md 

<130> 10423. 010-DK 
<160> 43 

<170> Patentm version 3.2 

<210> 1 
<2ll> 1062 
<212> DNA 

<213> Nocardiopsis sp. nrrl 18262 
<220> 

'"isc.feature 
<222> (l) . . ( 4 95) 

223> Encodes the pro-region shown in positions -165 to -1 of SEQ lo 
<220> 

< 221> misc_feature 
<"2> (496)..a0sl) 

<223> Encodes the .nature region shown in positions 1-188 of seq id 
<400> 1 



9Cta t ggag cattacctca ^^^^^ ^^^^^ ^ 

acaac gtgatcttga tcttacatca gctgaagctg aggaattact tgctgcacaa 
a a g Cct ttgMgttga ^^^^ ^^^^^ 

tcagtattcg atactgaatc actcgaactt actgtactag tgaccgatgc agcagct tt 
9 aagctgttg aagccacagg tgcaggtaca gagctcgtat cttatggtat tgatgga 
9 atgagatc g tacaagagct taatgcagct gatgccgttc caggtgtagt t g 
c 9 atgtag caggtgatac tgttgtctta gaagttcttg aaggctctgg 

1 :i: cttt ta9ca9ac9c a " agtc9at 9catccgc " - 9 La 

cagcctgaac tctatgccga tatcattgga ggcctagcgt acacaatggg t ggt C gctgc 
a 9 c 9 taggat ttgcagccac aaatgcagct ggacaacctg gcttcgtlc a 
9 ^ccgcg tcggtacaca ggttactatc ggcaatggaa gaggtgtct t tgag aagc 

9 ~ 9C — cgt ccactttac gcttactaac 

ttag tatcta gatacaacac tggcggatat gcaactgtag ^ 

la "I 9C9tCt9CC9 CtCa " 9tC9 aCt ~ aaccattcaa 
17Z 939C9t9a9 " atCCt9aa " taCC ^ aa cgaacatgac.tcgtacgact 

iizz: a : cca99t9a ctct " a99t tcatatatca ~* 

a a 9 " atCC " taaCt9ta " — "«« -acgttcta 

acaccgatgg tgaactcttg gggagttaga ctccgtacat aa 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1062 



<210> 2 10423. 010-DK.ST25.txt 

<211> 1143 

<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



-> 



<400> 2 



at g aa 9aaac c 9t t 9g9gaa a,» g tc g « a g cacc g „c tactca«« tgtt 9 «t« 

ITT tC9C " C " C t9Cta "" a 9C3tta " tC -« 9 aa 9 ca 

"*** C9 " 9CM9a *~ "Vacate a 9 ct 9 aa 9C r 
a 99 aa«ac ttgctgcaca agatacagcc tttgaagttg * 

****** catatggtgg tt „ gmtc 9atactgaM Mctc9aact J^-JJ 

~? Ca9Ca9 " 9t t9aa9 " 9tt 9a,9CCaCa9 «« -»««•» 

««at 99 ta ttgatggatt agatgagatc gtacaagagc ttaatgcagc * 

cca 9gI ttggatggta tKtgatgta gcaggtgata ^ * 

g rr 9 9asct9at9t ttct " ac " tta9casac9 — -«» 

gttgaagtga ccac g tca ga tcagcctgaa ctctatgccg „ °> 

tggacaac 9 c 

Mcttcm, ca g ct gga ca tt g c gg cc g c g « 9g tacac aggttactat cggcaatgga 
9a ggtgtct ttgagcaaag cgtatttccc gggaatgatg n J 

aac« t a c gctt ac ta a ct^atct a.atacaaca c tgg c ggata tgt 
c, at? atcaagcacc taMggctct agcgtctgcc ^ 

a l l 7 CCa " Ca a9C " 9a99t " 9a9C9t9a 9 " a "« 9a 

alota 9UC9aC t9tat9t9Ca 9aaCCa » t0 °«"*>™ 

ac™ M9C9Caa " C9tta " tCa " tM ""' — «** «W*e 
ac.ac g «ct acca gg aa g t 9 acacc g at g gtgaactctt ggggagttag , ca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1143 



<210> 3 

<211> 8 

<212> prt 

<213> Artificial sequence 
<220> 

<223> c-terminal amino acid tail pynwe^j * . 

invention. Tai 1 ex Pressed as fusion to protease of the 

<400> 3 

Gin ser His val |ln ser Ala Pro . 



<210> 4 
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<211> 24 0423 * 01 °-DK . ST2 5 . txt 

<212> DNA 

<213> Artificial sequence 
<220> 

<400> 4 

caatcgcatg ttcaatccgc tcca 

24 

<210> 5 
<211> 4 
<212> PRT 
<213> Artificial 

sequence 

<220> 

<223> l2SS2? amin ° acid tail ^pressed as fusion to protease of the 

<400> 5 

Gin ser Ala Pro 

<210> 6 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<2 " > SSCj^pftJS 0 !^ fiJSag] -no acid tail expressed as 

<400> 6 
caatcggctc ct 

12 

<210> 7 
<211> 2 
<212> prt 

<213> Artificial sequence 
<220> 

<2 " > liSESiff amin ° aGid tail pressed as fusion to protease of the •• 
<400> 7 
Gin Pro 

<210> 8 

<211> 6 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> %lX^l%USl°in*l SSffl£ «" «« -P-sed as 

<400> 8 
caacca 
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<210> 9 1042 3 - 010-OK.ST25.txt 

<211> 1 

<212> prt 

<213> Artificial sequence 
<220> 

<223> fSSSS? ami ' n0 acid ta11 e -P^ssed as fusion to protease of the 

<400> 9 

Pro 
1 

<210> io 
<211> 3 
<212> ONA 

<213> Artificial sequence 
<220> 

<223> SSS'JPEiUK'S'W fttS-jgl a»H„o acid tai, expressed as 

<400> 10 
cca 

3 

<210> 11 
<2ll> 45 
<212> dna 

<213> Artificial sequence 
<220> 

<223> Primer #252639 
<400> 11 

catgtgcatg tgggtaccgc aacgttcgca gatgctgctg aagag 45 

<210> 12 

<211> 44 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #251992 

<400> 12 

catgtgcatg tggtcgaccg attatggagc ggattgaaca tgcg 44 

<210> 13 

<211> 44 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179541 

<400> 13 

gcgttgagac gcgcggccgc gagcgccgtt tggctgaatg atac 44 

<210> 14 
<211> 43 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Primer #179542 
<400> 14 

gcgttgagac agctcgagca gggaaaaatg gaaccgcttt ttc 

<210> 15 

<211> 64 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer #179539 

<400> 15 

cc«ttgatc agaattcact ggccgtcgtt ttacaaccat tgcggaaaat agtcataggc 

<210> 16 

<211> 60 

<212> DMA 

<213> Artificial sequence 
<220> 

<223> Primer #179540 

<400> 16 

ggatccaga, ctggtacccg ggtctagagt cgacgcggcg gttcgcgtcc ggacagcaca 

<210> 17 

<211> 37 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179154 

<400> 17 

gttgtaaaac gacggccagt gaattctgat caaatgg 

<210> 18 

<211> 37 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer #179153 

<400> 18 

ccgcgtcgac actagacacg ggtacctgat ctagatc 

<210> 19 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #317 
<400> 19 

tggcgcaatc ggtaccatgg gg 
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<210> 20 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #139 NotI 
<400> 20 

catgtgcatg cggccgcatt aacgcgttgc cgcttctgcg 40 

<210> 21 
<211> 7443 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Sequence of plasmid pMBl508 
<400> 21 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240 

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgataaaagt gctttttttg 420 

ttgcaattga agaattatta atgttaagct taattaaaga taatatcttt gaattgtaac 480 

gcccctcaaa agtaagaact acaaaaaaag aatacgttat atagaaatat gtttgaacct 540 

tcttcagatt acaaatatat tcggacggac tctacctcaa atgcttatct aactatagaa 600 

tgacatacaa gcacaacctt gaaaatttga aaatataact accaatgaac ttgttcatgt 660 

gaattatcgc tgtatttaat tttctcaatt caatatataa_tatgccaata cattgttaca 720 

agtagaaatt aagacaccct tgatagcctt actataccta acatgatgta gtattaaatg 780 

aatatgtaaa tatatttatg ataagaagcg acttatttat aatcattaca tatttttcta 840 

ttggaatgat taagattcca atagaatagt gtataaatta tttatcttga aaggagggat 900 

gcctaaaaac gaagaacatt aaaaacatat atttgcaccg tctaatggat ttatgaaaaa 960 

tcattttatc agtttgaaaa ttatgtatta tggagctctg aaaaaaagga gaggataaag 1020 

aatgaagaaa ccgttgggga aaattgtcgc aagcaccgca ctactcattt ctgttgcttt 1080 

tagttcatcg atcgcatcgg ctgctgaaga agcaaaagaa aaatatttaa ttggctttaa 1140 

tgagcaggaa gctgtcagtg agtttgtaga acaagtagag gcaaatgacg aggtcgccat 1200 

tctctctgag gaagaggaag tcgaaattga attgcttcat gaatttgaaa cgattcctgt 1260 

tttatccgtt gagttaagcc cagaagatgt ggacgcgctt gaactcgatc cagcgatttc 1320 

ttatattgaa gaggatgcag aagtaacgac aatggcgcaa tcggtaccat ggggtatatc 1380 
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1440 
1500 
1560 
1620 
1680 
1740 
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aacgcgttaa tccgcggata tatagcggcc gcagatctgg gaccaataat aatgactaga 
gaagaaagaa tgaagattgt tcatgaaatt aaggaacgaa tattggataa agtgggatat 
ttttaaaata tatatttatg ttacagtaat attgactttt aaaaaaggat tgattctaat 
gaagaaagca gacaagtaag cctcctaaat tcactttaga taaaaattta ggaggcatat 
caaatgaact ttaataaaat tgatttagac aattggaaga gaaaagagat atttaatcat 
tatttgaacc aacaaacgac ttttagtata accacagaaa ttgatattag tgttttatac 
cgaaacataa aacaagaagg atataaattt taccctgcat ttattttctt agtgacaagg 
gtgataaact caaatacagc ttttagaact ggttacaata gcgacggaga gttaggttat _ 
tgggataagt tagagccact ttatacaatt tttgatggtg tatctaaaac attctctggt 1920 
atttggactc ctgtaaagaa tgacttcaaa gagttttatg atttatacct ttctgatgta 1980 
gagaaatata atggttcggg gaaattgttt cccaaaacac ctatacctga aaatgctttt 2040 
tctctttcta ttattccatg gacttcattt actgggttta acttaaatat caataataat 2100 
agtaattacc ttctacccat tattacagca ggaaaattca ttaataaagg taattcaata 2160 
tatttaccgc tatctttaca ggtacatcat tctgtttgtg atggttatca tgcaggattg 2220 
tttatgaact ctattcagga attgtcagat aggcctaatg actggctttt ataatatgag 2280 
ataatgccga ctgtactttt tacagtcggt tttctaacga tacattaata ggtacgaaaa 
agcaactttt tttgcgctta aaaccagtca taccaataac ttaagggtaa ctagcctcgc 
cggaaagagc gaaaatgcct cacatttgtg ccacctaaaa aggagcgatt tacatatgag „ w 
ttatgcagtt tgtagaatgc aaaaagtgaa atcagctgga ctaaaagggg ccgcagagta 2520 
gaatggaaaa ggggatcgga aaacaagtat ataggaggag acctatttat ggcttcagaa 2580 
aaagacgcag gaaaacagtc agcagtaaag cttgttccat tgcttattac tgtcgctgtg 2640 
ggactaatca tctggtttat tcccgctccg tccggacttg aacctaaagc ttggcatttg 2700 
tttgcgattt ttgtcgcaac aattatcggc tttatctcca agcccttgcc aatgggtgca 2760 
attgcaattt ttgcattggc ggttactgca ctaactggaa cactatcaat tgaggataca 2820 
ttaagcggat tcgggaataa gaccatttgg cttatcgtta tcgcattctt tatttcccgg 2880 
ggatttatca aaaccggtct cggtgcgaga atttcgtatg tattcgttca gaaattcgga 2940 
aaaaaaaccc ttggactttc ttattcactg ctattcagtg atttaatact ttcacctgct 
attccaagta atacggcgcg tgcaggaggc attatatttc ctattatcag atcattatcc 
gaaacattcg. gatcaagccc ggcaaatgga acagagagaa aaatcggtgc attcttatta 
aaaaccggtt ttcaggggaa tctgatcaca tctgctatgt tcctgacagc gatggcggcg 
aacccgctga ttgccaagct ggcccatgat gtcgcagggg tggacttaac atggacaagc 
tgggcaattg ccgcgattgt accgggactt gtaagcttaa tcatcacgcc gcttgtgatt 
tacaaactgt atccgccgga aatcaaagaa acaccggatg cggcgaaaat cgcaacagaa 
aaactgaaag aaatgggacc gttcaaaaaa tcggagcttt ccatggttat cgtgtttctt 
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2340 
2400 
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3000 
3060 
3120 
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3240 
3300 
3360 
3420 
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ttggtgcttg tgctgtggat ttttggcggc agcttcaaca tcgacgctac cacaaccgca 3480 

ttgatcggtt tggccgttct cttattatca caagttctga cttgggatga tatcaagaaa 3540 

gaacagggcg cttgggatac gctcacttgg tttgcggcgc. ttgtcatgct cgccaacttc 3600 

ttgaatgaat taggcatggt gtcttggttc agtaatgcca tgaaatcatc cgtatcaggg 3660 

ttctcttgga ttgtggcatt catcatttta attgttgtgt attattactc tcactatttc 3720 

tttgcaagtg cgacagccca catcagtgcg atgtattcag catttttggc tgtcgtcgtg 3780 

Qcagcgggcg caccgccgct tttagcagcg ctgagcctcg cgttcatcag caacctgttc 3840 

gggtcaacga ctcactacgg ttctggagcg gctccggtct tcttcggagc aggctacatc 3900 

ccgcaaggca aatggtggtc catcggattt atcctgtcga ttgttcatat catcgtatgg 3960 

cttgtgatcg gcggattatg gtggaaagta ctaggaatat ggtagaaaga aaaaggcaga 4020 
cgcggtctgc ctttttttat tttcactcct tcgtaagaaa atggattttg aaaaatgaga 



4080 



aaattccctg tgaaaaatgg tatgatctag gtagaaagga cggctggtgc tgtggtgaaa 4140 



4200 



aagcggttcc atttttccct gcaaacaaaa ataatggggc tgattgcggc tctgctggtc 

tttgtcattg gtgtgctgac cattacgtta gccgttcagc atacacaggg agaacggaga 4260 

caggcagagc agctggcggt tcaaacggcg agaaccattt cctatatgcc gccggttaaa 4320 

gagctcattg agagaaaaga cggacatgcg gctcagacgc aagaggtcat tgaacaaatg 4380 

aaagaacaga ctggtgcgtt tgccatttat gttttgaacg aaaaaggaga cattcgcagc 4440 

gcctctggaa aaagcggatt aaagaaactg gagcgcagca gagaaatttt gtttggcggt 4500 

tcgcatgttt ctgaaacaaa agcggatgga cgaagagtga tcagagggag cgcgccgatt 4560 

ataaaagaac agaagggata cagccaagtg atcggcagcg tgtctgttga ttttctgcaa 4620 
acggagacag agcaaagcat caaaaagcat ttgagaaatt tgagtgtgat tgctgtgctt 



4680 



gtactgctgc tcggatttat tggcgccgcc gtgctggcga aaagcatcag aaaggatacg 4740 

ctcgggcttg aaccgcatga gatcgcggct ctatatcgtg agaggaacgc aatgcttttc 4800 

gcgattcgag aagggattat tgccaccaat cgtgaaggcg tcgtcaccat gatgaacgta 4860 

tcggcggccg agatgctgaa gctgcccgag cctgtgatcc atcttcctat agatgacgtc 4920 

atgccgggag cagggctgat gtctgtgctt gaaaaaggag aaatgctgcc gaaccaggaa 4980 

gtaagcgtca acgatcaagt gtttattatc aatacgaaag tgatgaatca aggcgggcag 5040 

scgtatggga ttgtcgtcag cttcagggag aaaacagagc tgaagaagct gatcgacaca 5100 

ttgacagagg ttcgcaaata ttcagaggat ctcagggcgc agactcatga attttcaaat 5160 

aagctttatg cgattttagg gctgcgtcga cctgcaggca tgcaagcttg gcgtaatcat 5220 

ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 5280 

ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg 5340 

C9ttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 5400 

tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca 5460 
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ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 5520 

taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc 5580 

agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc 5640 

cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac 5700 

tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc 5760 

tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata 5820 

gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc 5880 

acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca 5940 

acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag 6000 

cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta 6060 

gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg 6120 

gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc 6180 

agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt 6240 

ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa 6300 

ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat 6360 

atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga 6420 

tctgtctatt tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac 6480 

gggagggctt accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg 6540 

caccggattt atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg 6600 

caactttatc cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt 6660 

cgccagttaa tagtttgcgc aacgttgttg ccattgctac aggcatcgtg gtgtcacgct 6720 

cgtcgtttgg tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat 6780 

cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta 6840 

agttggccgc agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca 6900 

tgccatccgt aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat 6960 

agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac 7020 

atagcagaac tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa 7080 

ggatcttacc gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt 7140 

cagcatcttt tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg 7200 

caaaaaaggg aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat 7260 

attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt 7320 

agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca cctgacgtct 7380 

aagaaaccat tattatcatg acattaacct ataaaaatag gcgtatcacg aggccctttc 7440 
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<210> 22 
<211> 5718 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Sequence of MB1510 genomic integration region 
<400> 22 

gagcgccgtt tggctgaatg atacaacagt ctcacttcct tactgcgtct ggttgcaaaa 60 

acgaagaagc aaggattccc ctcgcttctc atttgtccta tttattatac acttttttaa 120 

gcacatcttt ggcgcttgtt tcactagact tgatgcctct gaatcttgtc caagtgtcac 180 

ggtccgcatc atagacttgt ccatttttca ccgctttgag atttttccag agcgggttcg 240 

ttttccactc atctacaatg gttttgcctt cgttggctga gatgaacaaa atatcaggat 300 

cgattttgct caattgctca aggctgacct cttgataggc gttatctgac ttcacagcgt 360 

gtgtaaagcc tagcatttta aagatttctc cgtcatagga tgatgatgta tgaagctgga 420 

aggaatccgc tcttgcaacg ccgagaacga tgttgcggtt ttcatctttc ggaagttcgg 480 

cttttagatc gttgatgact tttttgtgct cggcaagctt ttcttttcct tcatcttctt 540 

tatttaatgc tttagcaatg gtcgtaaagc tgtcgatcgt ttcgtcatat gtcgcttcac 600 

ggctttttaa ttcaatcgtc ggggcgattt ttttcagctg tttataaatg tttttatggc 660 

gctcagcgtc agcgatgatt aaatcaggct tcaaggaact gatgacctca agattgggtt 720 

cgctgcgtgt gcctacagat gtgtaatcaa tggagctgcc gacaagcttt ttaatcatat 780 

cttttttgtt gtcatctgcg atgcccaccg gcgtaatgcc gagattgtga acggcatcca 840 

agaatgaaag ctcaagcaca accacccgct taggtgtgcc gcttactgtc gtttttcctt 900 

cttcgtcatg gatcactctg gaatccttag actcgctttt gccgcttccg ttgttattct 960 

ggcttgatga acagccggat acaatgaggc aggcgagcaa taaaacactc atgatggcaa 1020 

tcaacttgtt agaataggtg cgcatgtcat tcttcctttt ttcagattta gtaatgagaa 1080 

tcattatcac atgtaacact ataatagcat ggcttatcat gtcaatattt ttttagtaaa 1140 

gaaagctgcg tttttactgc tttctcatga aagcatcatc agacacaaat aagtggtatg 1200 

cagcgttacc gtgtcttcga gacaaaaacg catgggcgtt ggctttagag gtttcgaaca 1260 

tatcagcagt gacataagga aggagagtgc tgagataacc ggacaatttc ttttctattt 1320 

catctgttag tgcaaattca atgtcgccga tattcatgat aatcgagaaa acaaagtcga 1380 

tatcgatatg aaaatgttcc tcggcaaaaa ccgcaagctc gtgaattcct ggtgaacatc 1440 

cggcacgctt atggaaaatc tgtttgacta aatcactcac aatccaagca ttgtattgct 1500 

gttctggtga aaagtattgc attagacata cctcctgctc gtacggataa aggcagcgtt 1560 

tcatggtcgt gtgctccgtg cagcggcttc tccttaattt tgatttttct gaaaataggt 1620 

cccgttccta tcactttacc atggacggaa aacaaatagc tactaccatt cctcctgttt 1680 

ttctcttcaa tgttctggaa tctgtttcag gtacagacga tcgggtatga aagaaatata 1740 
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gaaaacatga aggaggaata tcgacatgaa accagttgta aaagagtata caaatgacga 1800 

acagctcatg aaagatgtag aggaattgca gaaaatgggt gttgcgaaag aggatgtata i860 

cgtcttagct cacgacgatg acagaacgga acgcctggct gacaacacga acgccaacac 1920 

gatcggagcc aaagaaacag gtttcaagca cgcggtggga aatatcttca ataaaaaagg 1980 

agacgagctc cgcaataaaa ttcacgaaat cggtttttct gaagatgaag ccgctcaatt 2040 

tgaaaaacgc ttagatgaag gaaaagtgct tctctttgtg acagataacg aaaaagtgaa 2100 

agcttgggca taaagcaagg aaaaaaccaa aaggccaatg tcggcctttt ggtttttttg 2160 

cggtctttgc ggtgggattt tgcagaatgc cgcaatagga tagcggaaca ttttcggttc 2220 

tgaatgtccc tcaatttgct attatatttt tgtgataaat tggaataaaa tctcacaaaa 2280 

tagaaaatgg gggtacatag tggatgaaaa aagtgatgtt agctacggct ttgtttttag 2340 

gattgactcc agctggcgcg aacgcagctg atttaggcca ccagacgttg ggatccaatg 2400 

atggctgggg cgcgtactcg accggcacga caggcggatc aaaagcatcc tcctcaaatg 2460 

tgtataccgt cagcaacaga aaccagcttg tctcggcatt agggaaggaa acgaacacaa 2520 

cgccaaaaat catttatatc aagggaacga ttgacatgaa cgtggatgac aatctgaagc 2580 

cgcttggcct aaatgactat aaagatccgg agtatgattt ggacaaatat ttgaaagcct 2640 

atgatcctag cacatggggc aaaaaagagc cgtcgggaac acaagaagaa gcgagagcac 2700 

gctctcagaa aaaccaaaaa gcacgggtca tggtggatat ccctgcaaac acgacgatcg 2760 

tcggttcagg gactaacgct aaagtcgtgg gaggaaactt ccaaatcaag agtgataacg 2820 
tcattattcg caacattgaa ttccaggatg cctatgacta ttttccgcaa tggttgtaaa 



2880 



acgacggcca gtgaattctg atcaaatggt tcagtgagag cgaagcgaac acttgatttt 2940 

ttaattttct atcttttata ggtcattaga gtatacttat ttgtcctata aactatttag 3000 

cagcataata gatttattga ataggtcatt taagttgagc atattagagg aggaaaatct 3060 

tggagaaata tttgaagaac ccgagatcta gatcaggtac cgcaacgttc gcagatgctg 3120 

ctgaagagat tattaaaaag ctgaaagcaa aaggctatca attggtaact gtatctcagc 3180 

ttgaagaagt gaagaagcag agaggctatt gaataaatga gtagaaagcg ccatatcggc 3240 

gcttttcttt tggaagaaaa tatagggaaa atggtacttg ttaaaaattc ggaatattta 3300 

tacaatatca tatgtatcac attgaaagga ggggcctgct gtccagactg tccgctgtgt 3360 

aaaaataagg aataaagggg ggttgacatt attttactga tatgtataat ataatttgta 3420 

taagaaaatg gaggggccct cgaaacgtaa gatgaaacct tagataaaag tgcttttttt 3480 

gttgcaattg aagaattatt aatgttaagc ttaattaaag ataatatctt tgaattgtaa 3540 

cgcccctcaa aagtaagaac tacaaaaaaa gaatacgtta tatagaaata tgtttgaacc 3600 

ttcttcagat tacaaatata ttcggacgga ctctacctca aatgcttatc taactataga 3660 

atgacataca agcacaacct tgaaaatttg aaaatataac taccaatgaa cttgttcatg 3720 

tgaattatcg ctgtatttaa ttttctcaat tcaatatata atatgccaat acattgttac 3780 
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aagtagaaat taagacaccc ttgatagcct tactatacct aacatgatgt agtattaaat 3840 

gaatatgtaa atatatttat gataagaagc gacttattta taatcattac atatttttct 3900 

attggaatga ttaagattcc aatagaatag tgtataaatt atttatcttg aaaggaggga 3960 

tgcctaaaaa cgaagaacat taaaaacata tatttgcacc gtctaatgga tttatgaaaa 4020 

atcattttat cagtttgaaa attatgtatt atggagctct gaaaaaaagg agaggataaa 4080 

gagaaaaggg gatcggaaaa caagtatata ggaggagacc. tatttatggc ttcagaaaaa 4140 

gacgcaggaa aacagtcagc agtaaagctt gttccattgc ttattactgt cgctgtggga 4200 

ctaatcatct ggtttattcc cgctccgtcc ggacttgaac ctaaagcttg gcatttgttt 4260 

gcgatttttg tcgcaacaat tatcggcttt atctccaagc ccttgccaat gggtgcaatt 4320 

gcaatttttg cattggcggt tactgcacta actggaacac tatcaattga ggatacatta 4380 

agcggattcg ggaataagac catttggctt atcgttatcg cattctttat ttcccgggga 4440 

tttatcaaaa ccggtctcgg tgcgagaatt tcgtatgtat tcgttcagaa attcggaaaa 4500 

aaaacccttg gactttctta ttcactgcta ttcagtgatt taatactttc acctgctatt 4560 

ccaagtaata cggcgcgtgc aggaggcatt atatttccta ttatcagatc attatccgaa 4620 

acattcggat caagcccggc aaatggaaca gagagaaaaa tcggtgcatt cttattaaaa 4680 

accggttttc aggggaatct gatcacatct gctatgttcc tgacagcgat ggcggcgaac 4740 

ccgctgattg ccaagctggc ccatgatgtc gcaggggtgg acttaacatg gacaagctgg 4800 

gcaattgccg cgattgtacc gggacttgta agcttaatca tcacgccgct tgtgatttac 4860 

aaactgtatc cgccggaaat caaagaaaca ccggatgcgg cgaaaatcgc aacagaaaaa 4920 

ctgaaagaaa tgggaccgtt caaaaaatcg gagctttcca tggttatcgt gtttcttttg 4980 

gtgcttgtgc tgtggatttt tggcggcagc ttcaacatcg acgctaccac aaccgcattg 5040 

atcggtttgg ccgttctctt attatcacaa gttctgactt gggatgatat caagaaagaa 5100 

cagggcgctt gggatacgct cacttggttt gcggcgcttg tcatgctcgc caacttcttg 5160 

aatgaattag gcatggtgtc ttggttcagt aatgccatga aatcatccgt atcagggttc 5220 

tcttggattg tggcattcat cattttaatt gttgtgtatt attactctca ctatttcttt 5280 

gcaagtgcga cagcccacat cagtgcgatg tattcagcat ttttggctgt cgtcgtggca 5340 

gcgggcgcac cgccgctttt agcagcgctg agcctcgcgt tcatcagcaa cctgttcggg 5400 

tcaacgactc actacggttc tggagcggct ccggtcttct tcggagcagg ctacatcccg 5460 

caaggcaaat ggtggtccat cggatttatc ctgtcgattg ttcatatcat cgtatggctt 5520 

gtgatcggcg gattatggtg gaaagtacta ggaatatggt agaaagaaaa aggcagacgc 5580 

ggtctgcctt tttttatttt cactccttcg taagaaaatg gattttgaaa aatgagaaaa 5640 

ttccctgtga aaaatggtat gatctaggta gaaaggacgg ctggtgctgt ggtgaaaaag 5700 
cggttccatt tttccctg 
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<210> 23 10423.010-DK.ST2S.txt 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1605 

<400> 23 

gacggccagt gaattcgata aaagtgc 

<210> 24 - 

<211> 42 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1606 
<220> 

< 221> misc_feature 

<222> (13)..C13) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> C16)..C16D 

<223> n is a, c, g, or t 

<400> 24 

ccagatctct atnktnktgt acggagtcta actccccaag ag 

<210> 25 
<211> 1U2 
<212> DNA 

<213> Nocardiopsis dassonvillei DSM 43235 
<400> 25 

gcttttagtt catcgatcgc atcggctgct ccggcccccg tcccccagac ccccgtcgcc 
gacgacagcg ccgccagcat gaccgaggcg ctcaagcgcg acctcgacct cacctcggcc 
gaggccgagg agcttctctc ggcgcaggaa gccgccatcg agaccgacgc cgaggccacc 
gaggccgcgg gcgaggccta cggcggctca ctgttcgaca ccgagaccct cgaactcacc 
gtgctggtca ccgacgcctc cgccgtcgag gcggtcgagg ccaccggagc ccaggccacc 
gtcgtctccc acggcaccga gggcctgacc gaggtcgtgg aggacctcaa cggcgccgag 360 
gttcccgaga gcgtcctcgg ctggtacccg gacgtggaga gcgacaccgt cgtggtcgag 
gtgctggagg gctccgacgc cgacgtcgcc gccctgctcg ccgacgccgg tgtggactcc 
tcctcggtcc gggtggagga ggccgaggag gccccgcagg tctacgccga catcatcggc 
ggcctggcct actacatggg cggccgctgc tccgtcggct tcgccgcgac caacagcgcc 
ggtcagcccg gtttcgtcac cgccggccac tgcggcaccg tcggcaccgg cgtgaccatc 
ggcaacggca ccggcacctt ccagaactcg gtcttccccg gcaacgacgc cgccttcgtc 
cgcggcacct ccaacttcac cctgaccaac ctggtctcgc gctacaactc cggcggctac 
cagtcggtga ccggtaccag ccaggccccg gccggctcgg ccgtgtgccg ctccggctcc 
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accaccggct ggcactgcgg caccatccag gcccgcaacc agaccgtgcg ctacccgcag 900 

ggcaccgtct actcgctcac ccgcaccaac gtgtgcgccg agcccggcga ctccggcggt 960 

tcgttcatct ccggctcgca ggcccagggc gtcacctccg gcggctccgg caactgctcc 1020 

Qtcggcggca cgacctacta ccaggaggtc accccgatga tcaactcctg gggtgtcagg 1080 

atccggacct aatcgcatgt tcaatccgct cc 1112 

<210> 26 

<211> 48 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1423 

<400> 26 

gcttttagtt catcgatcgc atcggctgct ccggcccccg tcccccag 48 

<210> 27 

<211> 45 

<212> DNA 

<213:> Artificial sequence 
<220> 

<223> Primer 1475 ^ 

<400> 27 

ggagcggatt gaacatgcga ttaggtccgg atcctgacac cccag 45 

<210> 28 

<211> 354 

<212> PRT 

<213> Nocardiopsis dassonvillei DSM 43235 
<220> 

<221> propep ' 

<222> (1) . . (166) 

<220> 

<221> mat .peptide 
<222> (167). .(354) 

<400> 28 

Ala Pro Ala Pro Val Pro Gin Thr Pro val Ala Asp Asp Ser Ala 
AD:> -160 ~i55 

Ala ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr ser 
x:>u -145 -140 

Ala Glu Ala Glu Glu Leu Leu ser Ala Gin Glu Ala Ala He Glu 
■ L3 ^ -130 -125 

Thr Asp Ala Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly 
u -115 -110 
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ser Leu 5 Phe Asp Thr Glu ^^A^tf*^ Val Thr Asp 

Ala ser Ala val Glu Ala Val clu Ala Thr Gly Ala Gin Ala Thr val 
w -85 -80 -75 

. Val ser His Gly Thr Glu Gly Leu Thr Glu Val val Glu Asp Leu Asn 
" 70 -65 -60 

Gly Ala Glu val Pro Glu ser val Leu Gly Trp Tyr Pro Asp val Glu 
~- >:> -50 -45 

Ser Asp Thr val val val Glu val Leu Glu Gly ser Asp Ala Asp val 
~ HV -35 -30 

Ala Ala Leu Leu Ala Asp Ala Gly val Asp ser ser Ser val Arg val 

Glu Glu Ala Glu Glu Ala Pro Gin val Tyr Ala Asp lie lie Gly Gly 

~1 1 5 

Leu Ala Tyr Tyr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr 
xu 15 20 

Asn ser Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Thr 

30 35 

val Gly Thr Gly val Thr lie Gly Asn Gly Thr Gly Thr Phe Gin Asn 

ser val Phe Pro Gly Asn Asp Ala Ala Phe val Arg Gly Thr Ser Asn 
J ou 65 70 

Phe Thr Leu Thr A|n Leu val ser Arg Tgr Asn ser Gly Gly T^r Gin 

ser val Thr Gly Thr ser Gin Ala Pro Ala Gly ser Ala val cys Arg 
90 95 100 

ser Gly ser Thr Thr Gly Trp His cys Gly Thr He Gin Ala Arg Asn 

110 115 

Gin Thr val Arg Tyr pro Gin Gly Thr val Tyr Ser Leu Thr Arg Thr 
x * u 125 130 

Asn val cys Ala Glu Pro Gly Asp Ser Gly Gly ser Phe He Ser Gly 
i:,:> 14( > 145 150 

ser Gin Ala Gin Gly val Thr ser Gly Gly ser Gly Asn Cys Ser val 
155 160 165 
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Gly Gly Thr Thr Ty r Tyr Gin Gl^V^Tr^eV^e As- ser Trp 

175 180 

Gly Val Arg lie Arg Thr 
185 

<210> 29 
<211> 498 
<212> ONA 

<213> Nocardiopsis dassonvillei DSM 43235 
<400> 29 

gctccggccc ccgtccccca gacccccgtc gccgacgaca gcgccgccag catgaccgag 60 
gcgctcaagc gcgacctcga cctcacctcg gccgaggccg . aggagcttct ctcggcgcag 120 
gaagccgcca tcgagaccga cgccgaggcc accgaggccg cgggcgaggc ctacggcggc 180 
tcactgttcg acaccgagac cctcgaactc accgtgctgg tcaccgacgc ctccgccgtc 240 
gaggcggtcg aggccaccgg agcccaggcc accgtcgtct cccacggcac cgagggcctg 300 
accgaggtcg tggaggacct caacggcgcc gaggttcccg agagcgtcct cggctggtac 360 
ccggacgtgg agagcgacac cgtcgtggtc gaggtgctgg agggctccga cgccgacgtc 420 
gccgccctgc tcgccgacgc cggtgtggac tcctcctcgg tccgggtgga ggaggccgag 480 
gaggccccgc aggtctac 498 

<210> 30 

<211> 166 

<212> prt 

<213> Nocardiopsis dassonvillei DSM 43235 

<400> 30 

Ala Pro Ala Pro val Pro Gin Thr Pro val Ala Asp Asp ser Ala Ala 
1 5 10 15 

Ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr Ser Ala Glu 
20 25 30 

Ala Glu Glu Leu Leu ser Ala Gin Glu Ala Ala lie Glu Thr Asp Ala 
35 40 45 

Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly ser Leu Phe Asp 

Thr Glu Thr Leu. Glu Leu Thr val Leu val Thr Asp Ala ser Ala val 
05 70 75 80 

Glu Ala val Glu Ala Thr Gly Ala Gin Ala Thr Val val Ser His Gly 
©a 90 95 

Thr Glu Gly Leu Thr Glu val val Glu Asp Leu Asn Gly Ala Glu Val 
luo 105 110 . 
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Pro Glu ffl val Leu Gly Trp J^^fffi-g Asp Thr Val 
val val Glu val Leu Glu Gly ser Asp Ala Asp val Ala Ala Leu Leu 

Ala Asp Ala Gly val Asp sen ser ser val Arg val Glu Glu Ala Glu 

155 160 

Glu Ala Pro Gin val Tyr 
165 

<210> 31 
<211> 1146 
<212> ONA 

<213> Artificial sequence 
<220> 

<223> in%?a^ S ?2 U SI! 8 E? d D ?g t S" the nro-region of seq id no: 29 fused 
construct? 10R(J?ill§i8j2) 6 taiT " variant encoding gene; whole 

<400> 31 

atgaagaaac cgttggggaa aattgtcgca agcaccgcac tactcatttc tgttgctttt 60 

agttcatcga tcgcatcggc tgctccggcc cccgtccccc agacccccgt cgccgacgac 120 

agcgccgcca gcatgaccga ggcgctcaag cgcgacctcg acctcacctc ggccgaggcc 180 

gaggagcttc tctcggcgca ggaagccgcc atcgagaccg acgccgaggc caccgaggcc 240 

gcgggcgagg cctacggcgg ctcactgttc gacaccgaga ccctcgaact caccgtgctg 300 

gtcaccgacg cctccgccgt cgaggcggtc gaggccaccg gagcccaggc caccgtcgtc 360 

tcccacggca ccgagggcct gaccgaggtc gtggaggacc tcaacggcgc cgaggttccc 420 

gagagcgtcc tcggctggta cccggacgtg gagagcgaca ccgtcgtggt cgaggtgctg 480 

gagggctccg acgccgacgt cgccgccctg ctcgccgacg ccggtgtgga ctcctcctcg 540 

gtccgggtgg aggaggccga ggaggccccg caggtctatg ccgatatcat tggaggccta 600 

gcgtacacaa tgggtggtcg ctgcagcgta ggatttgcag ccacaaatgc agctggacaa • 660 

cctggcttcg tgacagctgg acattgcggc cgcgtcggta cacaggttac tatcggcaat 720 

ggaagaggtg tctttgagca aagcgtattt cccgggaatg atgctgcctt cgttagaggt 780 

acgtccaact ttacgcttac taacttagta tctagataca acactggcgg atatgcaact 840 

gtagcaggtc acaatcaagc acctattggc tctagcgtct gccgctcagg gtcgactaca 900 

ggatggcatt gtggaaccat tcaagctaga ggtcagagcg tgagctatcc tgaaggtacc 960 

gtaacgaaca tgactcgtac gactgtatgt gcagaaccag gtgactctgg aggttcatat 1020 

atcagcggta cgcaagcgca aggcgttacc tcaggtggat ccggtaactg taggacaggt 1080 

ggcacaacgt tctaccagga agtgacaccg atggtgaact cttggggagt tagactccgt 1140 
acataa 

1146 
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<210> 32 10423.010-DK.ST25.txt 
<211> 1068 
<212> DNA 

<213> Nocardiopsis Alba dsm 15647 
<400> 32 

gcgaccggcc ccctccccca gtcccccacc ccggatgaag ccgaggccae caccatggtc 60 

gaggccctcc agcgcgacct cggcctgtcc ccctctcagg ccgacgagct cctcgaggcg 120 

caggccgagt ccttcgagat cgacgaggcc gccaecgcgg ccgcagccga ctcctacggc 180 

ggctccatct tcgacaccga cagcctcacc ctgaccgtcc tggtcaccga cgcctccgcc 240 

gtcgaggcgg tcgaggccgc cggcgccgag gccaaggtgg tctcgcacgg catggagggc 300 

ctggaggaga tcgtcgccga cctgaacgcg gccgacgctc agcccggcgt cgtgggctgg 360 

taccccgaca tccactccga cacggtcgtc ctcgaggtcc tcgagggctc cggtgccgac 420 

gtggactccc tgctcgccga cgccggtgtg gacaccgccg acgtcaaggt ggagagcacc 480 

accgagcagc ccgagctgta cgccgacatc atcggcggtc tcgcctacac catgggtggg 540 

cgctgctcgg tcggcttcgc ggccaccaac gcctccggcc agcccgggtt cgtcaccgcc 600 

ggccactgcg gcaccgtcgg caccccggtc agcatcggca acggccaggg cgtcttcgag 660 

cgttccgtct tccccggcaa cgactccgcc ttcgtccgcg gcacctcgaa cttcaccctg 720 

accaacctgg tcagccgcta caacaccggt ggttacgcga ccgtctccgg ctcctcgcag 780 

gcggcgatcg gctcgcagat ctgccgttcc ggctccacca ccggctggca ctgcggcacc 840 

gtccaggccc gcggccagac ggtgagctac ccccagggca ccgtgcagaa cctgacccgc 900 

accaacgtct gcgccgagcc cggtgactcc ggcggctcct tcatctccgg cagccaggcc 960 

cagggcgtca cctccggtgg ctccggcaac tgctccttcg gtggcaccac ctactaccag 1020 

gaggtcaacc cgatgctgag cagctggggt ctgaccctgc gcacctga 1068 

<210> 33 

<211> 355 

<212> PRT" 

<213>. Nocardiopsis Alba OSM 15647 
<220> 

<221> PROPEP 

<222> CD . . (167) 

<220> 

<221> mat_pepti de 
<222> (168>.. (355) 

<400> 33 

Ala Thr Gly^ Pro Leu Pro Gin ser^ Pro Thr Pro Asp Glu^ Ala Glu 
Ala Thr Thr Q Met val Glu Ala Leu Gin Arg Asp Leu Gly Leu Ser 



-140 



Pro ser Gin Ala Asp Glu Leu Leu Glu Ala Gin Ala Glu ser Phe 

Page 18 



-135 10423. OlO-DK.STZS.txt^^ 

Glu lie Asp Glu Ala Ala Tbr Ala Ala Ala Ala Asp Ser Tyr Gly 
A£U -115 -110 

Gly ser lie Phe Asp Thr Asp ser Leu Thr Leu Thr val Leu Val Th 
XUD -100 -95 

Asp Ala ser Ala val Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys 

-« -80 

val val ser His Gly Met Glu Gly Leu Glu Glu He val Ala Asp Leu 

~ 70 -65 -60 

Asn Ala Ala Asp Ala Gin Pro Gly val val Gly Trp Tyr Pro Asp He 
"55 -50 _ 4 | 

His ser Asp Thr Val val Leu Glu val Leu Glu Gly ser Gly Ala Asp 

-35 -30 

val Asp ser Leu Leu Ala Asp Ala Gly val Asp Thr a!| Asp Val Lys 

val Glu ser Thr Thr Glu Gin Pro Glu Leu Tyr Ala Asp lie He Gly 

Gly Leu Ala Tyr Thr Met Gly Gly Arg cys ser val Gly Phe Ala Ala 
10 15 20 

Thr Asn Ala ser Gly Gin Pro Gly Phe val Thr Ala Gly His cys Gly 

Thr val Gly Thr Pro val ser lie Gly Asn Gly Gin Gly val Phe Glu 

Arg ser val Phe Pro Gly Asn Asp ser Ala Phe val Arg Gly Thr ser 

60 55 

Asn Phe Thr Leu Thr Asn Leu val ser Arg Tyr Asn Thr Gly Gly Tyr 

75 80 85 

Ala Thr val Ser Gly ser Ser Gin Ala Ala lie Gly ser Gin lie Cys 
90 .95 100 

Arg ser Gly ser Thr Thr Gly Trp His cys Gly Thr val Gin Ala Arg 

110 H5 

Gly Gin Thr val ser Tyr Pro Gin Gly Thr val Gin Asn Leu Thr Arg 

Thr ash val cys Ala Glu Pro Gly Asp ser Gly Gly ser Phe He Ser 
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-.or ... 10423.010-DK.ST25.txt 

13 •> 140 145 

Gly ser Gin Ala Gin Gl| val Thr Ser Gly Gly ser Gly Asn cys ser . 

Phe Gly Gly Thr Thr Tyr Tyr Gin Glu Val Asn Pro Met Leu ser ser 
A/u 175 180 

Trp Gly Leu Thr Leu Arg Thr 
185 

<210> 34 
<211> 43 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1421 
<400> 34. 

gttcatcgat cgcatcggct gcgaccggcc ccctccccca gtc 43 

<210> 35 
<211> 31 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1604 



<400> 35 

gcggatccta tcaggtgcgc agggtcagac c 



31 



<210> 36 
<211> 1062 
<212> DNA 

<213> Nocardiopsis prasina dsm 15648 
<400> 36 

gccaccggac cgctccccca gtcacccacc ccggaggccg acgccgtctc catgcaggag 60 

gcgctccagc gcgacctcgg cctgaccccg cttgaggccg atgaactgct ggccgcccag 120 

gacaccgcct tcgaggtcga cgaggccgcg gccgcggccg ccggggacgc ctacggcggc 180 

tccgtcttcg acaccgagac cctggaactg accgtcctgg tcaccgacgc cgcctcggtc 240 

gaggctgtgg aggccaccgg cgcgggtacc gaactcgtct cctacggcat cgagggcctc 300 

gacgagatca tccaggatct caacgccgcc gacgccgtcc ccggcgtggt cggctggtac 360 

ccggacgtgg cgggtgacac cgtcgtcctg gaggtcctgg agggttccgg agccgacgtg 420 

agcggcctgc tcgccgacgc cggcgtggac gcctcggccg tcgaggtgac cagcagtgcg 480 

cagcccgagc tctacgccga catcatcggc ggtctggcct acaccatggg cggccgctgt 540 

tcggtcggat tcgcggccac caacgccgcc ggtcagcccg gattcgtcac cgccggtcac 600 

tgtggccgcg tgggcaccca ggtgagcatc ggcaacggcc agggcgtctt cgagcagtcc 660 

atcttcccgg gcaacgacgc cgccttcgtc cgcggcacgt ccaacttcac gctgaccaac 720 
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1062 



10423.010-DK.ST25.txt 

ctggtcagcc gctacaacac cggcggttac gccaccgtcg ccggccacaa ccaggcgccc 780 

atcggctcct ccgtctgccg ctccggctcc accaccggct ggcactgcgg caccatccag 840 

gcccgcggcc agtcggtgag ctaccccgag ggcaccgtca ccaacatgac ccggaccacc 90 0 

gtgtgcgccg agcccggcga ctccggcggc tcctacatct ccggcaacca ggcccagggc 960 

gtcacctccg gcggctccgg caactgccgc accggcggga ccaccttcta ccaggaggtc 1020 
acccccatgg tgaactcctg gggcgtccgt ctccggacct aa 

<210> 37 

<2ll> 353 

<212> . PRT 

<213> Nocardiopsis prasina OSM 15648 
<220> 

<221> PROPEP 

<222> (1) . . (165) 

<220> 

<221> mat_peptide 
<222> (166).. (353) 

<400> 37 

-165 Gly Pr ° L6U ?5So Gln Ser Pro ™r PJ° 5 Glu Ala Asp Ala 

-i5o Ser Met Gln Glu t^s Leu Gln Arg Asp IrfJo Gly Leu Thr Pro 

-135 GlU Ala ASP Gl " bSo L6U Ala Ala Gl " A ^ 5 Thr Ala Phe Glu 

-i2o Asp Glu Ala Ala tll s Ala Ala Ala Gly tl% Ala Tyr Glv Gl y 

?105 Val PHe ASP Thr G ]&> Thr Leu Glu Leu Thr val >-eu val Thr Asp 

uu -95 -90 

Ala Ala ser val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 

val ser Tyr Gly He Glu Gly Leu Asp Glu lie lie Gln Asp Leu Asn 

-65 -60 

Ala Ala as| Ala val Pro Gly val val Gly Trp Tyr Pro Asp val Ala 

-:>u -45 

Gly Asp Thr val val Leu Glu val Leu Glu Gly ser Gly Ala Asp val 

ser Gly Leu Leu Ala Asp Ala Gly val Asp Ala ser Ala Val Glu val 

"15 _ 10 
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10423.010-DK.ST25.txt 
Thr Ser ser Ala Gin Pro Glu Leu Tyr Ala Asp He lie Gly Gly Leu 

Ala Tyr Thr wet Gly Gly Arg cys ser val Gly Phe Ala Ala Thr Asn 

15 20 . 

Ala Ala Gly Gin Pro Gly Phe val Thr Ala Gly His Cys Gly Arg Val 

Gly Thr Gin val ser lie Gly Asn Gly Gin Gly val Phe Glu Gin ser 

50 55 

He Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe 
ou 65 70 

Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 
•* 80 85 

val Ala Gly His Asn Gin Ala Pro lie Gly ser ser val cys Arg ser 

95 100 

Gly ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Gly Gin 

ser val ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 

125 130 - 135 

val cys Ala Glu Pro Gly Asp ser Gly Gly ser Tyr He ser Gly Asn 
±w 145 150 

Gin Ala Gin Gly val Thr Ser Gly Gly ser Gly Asn cys Ara Thr Gly 

Gly Thr Thr Phe fyr Gin Glu val Thr Pro Met val Asn ser Trp Gly 
1/0 175 180 

val Arg Leu Arg Thr 
185 

<210> 38 

<211> 43 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1346 

<400> 38 

gttcatcgat cgcatcggct gccaccggac cgctccccca gtc 



<210> 39 
<211> 38 
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<212> DNA 10423.010-DK.ST25.txt 

<213> Artificial sequence 

<220> 

<223> primer 1602 
<400> 39 

gcggatccta ttaggtccgg agacggacgc cccaggag 38 

<210> 40 
<211> 1062 
<212> DNA . 

<213> Nocardiopsis prasina dsm 15649 
<400> 40 

gccaccggac cactccccca gtcacccacc ccggaggccg acgccgtctc catgcaggag 60 

gcgctccagc gcgacctcgg cctgaccccg cttgaggccg atgaactgct ggccgcccag 120 

gacaccgcct tcgaggtcga cgaggccgcg gccgaggccg ccggtgacgc ctacggcggc 180 

tccgtcttcg acaccgagac cctggaactg accgtcctgg tcaccgactc cgccgcggtc 240 

gaggcggtgg aggccaccgg cgccgggacc gaactggtct cctacggcat cacgggcctc 300 

gacgagatcg tcgaggagct caacgccgcc gacgccgttc ccggcgtggt cggctggtac 360 

ccggacgtcg cgggtgacac cgtcgtgctg gaggtcctgg agggttccgg cgccgacgtg 420 

ggcggcctgc tcgccgacgc cggcgtggac gcctcggcgg tcgaggtgac caccaccgag 480 

cagcccgagc tgtacgccga catcatcggc ggtctggcct acaccatggg cggccgctgt S40 

tcggtcggct tcgcggccac caacgccgcc ggtcagcccg ggttcgtcac cgccggtcac 600 

tgtggccgcg tgggcaccca ggtgaccatc ggcaacggcc ggggcgtctt cgagcagtcc 660 

atcttcccgg gcaacgacgc cgccttcgtc cgcggaacgt ccaacttcac gctgaccaac 720 

ctggtcagcc gctacaacac cggcggctac gccaccgtcg ccggtcacaa ccaggcgccc 780 

atcggctcct ccgtctgccg ctccggctcc accaccggtt ggcactgcgg caccatccag 840 

gcccgcggcc agtcggtgag ctaccccgag ggcaccgtca ccaacatgac gcggaccacc 900 

gtgtgcgccg agcccggcga ctccggcggc tcctacatct ccggcaacca ggcccagggc 960 

gtcacctccg gcggctccgg caactgccgc accggcggga ccaccttcta ccaggaggtc 1020 

acccccatgg tgaactcctg gggcgtccgt ctccggacct aa 1062 

<210> 41 

<211> 353 

<212> PRT 

<213> Nocardiopsis prasina DSM 15649 
<220> 

<221> PROPEP 

<222> (1) . . C165) 

<220> 

<221> mat neptide 
<222> (166).. (3 53) 

<400> 41 
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10423 . 010-DK . ST2 5 . txt 
-165 -160 Gl " Pr ° ThP P i55 61 U Ala ASP Ala 

val ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 
" 1:>0 -145 -140 

Leu^ Glu Ala Asp Glu Leu^ Leu Ala Ala Gin Asp Thr Ala Phe Glu 



-135 - iiio " " 

Val Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly 
«° -115 -Hi) 

Ser val Phe Asp Thr Glu Thr Leu Glu Leu Thr val Leu Val Thr Asp 
"■ LUi -100 -95 -go 

Ser Ala Ala val Glu Ala val Glu Ala Thr Gly Ala Gly Thr Glu Leu 
-85 -80 -75 

val ser Tyr Gly lie Thr Gly Leu Asp Glu He val Glu Glu Leu Asn 
"70 -65 -60 

Ala Ala Asp Ala Val Pro Gly val Val Gly Trp Tyr Pro Asp val Ala 
-55 -50 -45 

Gly Asp Thr val val Leu Glu val Leu Glu Gly ser Gly Ala Asp val 
-W -35 _3q 

Gly Gly Leu Leu Ala Asp Ala Gly val Asp Ala ser Ala val Glu val 
~" -20 -15 _io 

Thr Thr Thr Glu Gin Pro Glu Leu T^r Ala Asp He lie Gly Gly Leu 

Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn 
10 15 20 

Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His cys Gly Arg val 
*5 30 35 

Gly Thr Gin val Thr lie Gly Asn Gly Arg Gly val Phe Glu Gin Ser 
*»« 45 50 55 

lie Phe Pro Gly Asn Asp Ala Ala Phe val Arg Gly Thr Ser Asn Phe 
60 65 70 

Thr Leu Thr Asn Leu val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 
75 80 85 

val Ala Gly His Asn Gin Ala Pro lie Gly ser ser val cys Arg ser 
90 95 100 
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10423 . 010-DK . ST25 . txt 
Gly Ser Thr Thr Gly Trp His cys Gly Thr He Gin Ala Arg Gly Gin 

Ser val Ser Tyr Pro Glu Gly Thr val Thr Asn Met Thr Arg Thr Thr 

125 130 " 135 

val cys Ala Glu Pro Gly Asp Ser Gly Gly ser Tyr He Ser Gly Asn 
140 145 150 

Gin Ala Gin Gly val Thr Ser Gly Gly ser Gly Asn cys Arg Thr Gly 
155 160 165 

Gly Thr Thr Phe Tyr Gin Glu val Thr Pro Met val Asn Ser Trp Gly 
170 175 180 

val Arg Leu Arg Thr 
185 

<210> 42 

<211> 43 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1603 

<4Q0> 42 

gttcatcgat cgcatcggct gccaccggac cactccccca gtc 

<210> 43 

<211> 353 

<212> prt 

<213> Nocardiopsis sp. nrrl 18262 
<220> 

<221> PROPEP 

<222> CD . . (165) 

<220> 

<221> mat_peptide 
<222> (166).. (1059) 

<400> 43 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-">•> -160 -155 

val ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser 
A:>u -145 -140 

Al| s Glu Ala Glu Glu Leu o Leu Ala Ala Gin Asp^ Thr Ala Phe Glu 

-120 ASP Glu Ala Ala A l? c Glu Ala Ala G_| y AS P Ala Tyr Gly Gly 
XiKJ -115 -HO 
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10423 . 010-DK . ST25 . txt 

ser val Phe Asp Thr Glu ser Leu Glu Leu Thr Val Leu val Thr Asp 
xui -100 _95 _9q 

Ala Ala Ala val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 
-85 -80 -75 

val ser Tyr Gly He Asp Gly Leu Asp Glu He val Gin Glu Leu Asn 
" 70 -65 -60 

Ala Ala Asp Ala val Pro Gly val val Gly Trp Tyr pro Asp val Ala 
-55 -50 _45 

Gly Asp Thr val val Leu Glu val Leu Glu Gly ser Gly Ala Asp val 
-40 -35 -30 

Ser Gly Leu Leu Ala Asp Ala Gly val Asp Ala Ser Ala Val Glu Val 
~" -20 -15 _io 

Thr Thr ser Asp Gin Pro Glu Leu T^r Ala Asp lie lie. Gly Gly Leu 

Ala Tyr Thr Met Gly Gly Arg cys ser val Gly Phe Ala Ala Thr Asn 
10 , 15 20 

Ala Ala Gly Gin pro Gly Phe val Thr Ala Gly His cys Gly Arg val 
25 30 35 

Gly Thr Gin Val Thr lie Gly Asn Gly Arg Gly Val phe Glu Gin Ser 
<» u 45 50 55 

val Phe Pro Gly Asn Asp Ala Ala phe val Arg Gly Thr ser Asn Phe 
60 65 70 

Thr Leu Thr Asn Leu val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 
75 80 85 

Val Ala Gly His Asn Gin Ala Pro lie Gly ser Ser val Cys Ara ser 
90 95 100 

Gly ser Thr Thr Gly Trp His Cys Gly Thr lie Gin Ala Arg Gly Gin 
105 110 115 

Ser val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 

125 130 " 135 

Val cys Ala Glu Pro Gly Asp ser Gly Gly ser Tyr lie Ser Gly Thr 
14 0 145 150 

Gin Ala Gin Gly val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr Gly 

160 165 
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10423 . 010-DK. ST25 . txt 

Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met val Asn ser Tro Glv 
170 175 180 

val Arg Leu Arg Thr 
185 



Page 27 



